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Background Of The Invention 

A common technique for cloning receptors is to use nucleic acid hybridization 
technology to identify receptors which are homologous to other, known receptors. For 
instance, originally the cloning of seven transmembrane domain G protein-coupled receptors 
(GCR) depended on the isolation and sequencing of the corresponding protein or the use of 
expression cloning techniques. However, when sequences for these receptors became 
available, it was apparent that there were significant sequence homologies between these 
receptors. This technology, since it does not require that the ligand of the receptor have 
been identified, has resulted in the cloning of a large number of "orphan receptors", which 
have no known ligand and often whose biological function is obscure. Receptors of all types 
comprise this large family. Known orphan receptors include the nuclear receptors COUP- 
TF1/EAR3, COUP-TF2/ARP 1 , EAR-1, EAR-2, TR-2, PPAR1, HNF-4, ERR-1, ERR-2, 
NGFIB/Nur77, ELP/SF-1 and MPL (Parker et al, supra, and Power et al. (1992) TIBS 
13:318-323). A large number of orphan receptors have been identified in the EPH family 
(Hirai et al (1987) Science 238:1717-1720). HER3 and HER4 are orphan receptors in the 
epidermal growth factor receptor family (Plowman et al. (.1993) Proc. Natl Acad. Set USA 
90:1746-1750). ILA is a newly identified member of the human nerve growth factor/tumor 
necrosis factor receptor family (Schwarz et al. (1993) Gene 134:295-298). IRRR is an 
orphan insulin receptor-related receptor which is a transmembrane tyrosine kinase (Shier et 
al. (1989) J. Biol Chem 264:14606-14608). Several orphan tyrosine kinase receptors have 
been found in Drosophila (Penrimon (1994) Curr. Opin. Cell Biol. 6:260-266). The 
importance of identifying ligands for orphan receptors is clear; it opens up a wide area for 
research in the area of drug discovery. 
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^ One large subgroup of orphan receptors, as alluded to above, are found in the G 

protein coupled receptor family. Approximately 100 such receptors have been identified by 
. function and these mediate transmembrane signaling from external stimuli (vision, taste and 
smell), endocrine function (pituitary and adrenal), exocrine function (pancreas), heart rate, 
lipolysis, and carbohydrate metabolism. Structural and genetic similarities suggest that G 
proteinrcoupled receptor superfamily can be subclassified into five distinct groups: (i) amine 
receptors (serotonin, adrenergic, etc.); (ii) small peptide hormone (somatostatin, TRH, etc.); 
(iii) large peptide hormone (LH-CG, FSH, etc.); (iv) secretin family; and (v) odorant 
1 receptors (Buck L. and Axel, R. (1991) Cell 65:175-187), with orphan receptors apparently 
occurring in each of the sub-families. 

Previous work describes the . expression of recombinant mammalian G 
protein-coupled receptors as a means of studying receptor function as a means of identifying 
agonists and antagonists of those receptors. For example, the human muscarinic receptor 
(HM1) has been functionally expressed in mouse cells (Haipold et al. US Pat. 5,401,629). 
The rat V lb vasopressin receptor has been found to stimulate phosphotidy. inositol hydrolysis 
and intracellular Ca2+ mobilization in Chinese hamster ovary cells upon agonist stimulation 
(Lolait et al. (1995) Proc Natl Acad ScL tiSA 92:6783-6787). These types of ectopic 
expression studies have enabled researchers to study receptor signalling mechanisms and to 
perform mutagenisis studies which have been useful in identifying portions of receptors that 
are critical for ligand binding or signal transduction. 

Experiments have also been undertaken to express functional G protein coupled 
receptors in yeast cells. For example, U.S. Patent 5,482,835 to King et al. describes a 
transformed yeast cell which is incapable of producing a yeast G protein a subunit, but which 
has been engineered to produce both a mammalian G protein a-subunit and a mammalian 
receptor which is "coupled to" (i.e., interacts with) the aforementioned mammalian G protein 
a-subunit. Specifically, U.S. Patent 5,482,835 reports expression of the human beta-2 
adrenergic receptor (p2AR), a seven transmembrane receptor (STR), in yeast, under control 
of the GAL1 promoter, with the P2AR gene modified by replacing the first 63 base pairs of 
coding sequence with 1 1 base pairs of noncoding and 42 base pairs of coding sequence from 
the STE2 gene. (STE2 encodes the yeast a-factor receptor). The Duke researchers found that 
the modified P2AR was functionally integratedlnto the membrane, as shown by studies of 
the ability of isolated membranes to interact properly with various known agonists and 
antagonists of p2AR. The ligand binding affinity for yeast-expressed P2AR was said to be 
nearly identical to that observed for naturally produced P2AR. 

U.S. Patent 5,482,835 describes co-expression of a rat G protein a-subunit in the 
same cells, yeast strain 8C, which lacks the cognate yeast protein. Ligand binding resulted in 
G protein-mediated signal transduction. U.S. Patent 5,482,835 teaches that these cells may 
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be used in screening compounds for the ability to affect the rate of dissociation of Ga from 
GPy in a cell. For this purpose, the cell further contains a pheromone-responsive promoter 
(e.g. BAR1 or FUS1), linked to an indicator gene (e.g. HIS3 or LacZ). The cells are placed in 
multi-titer plates, and different compounds are placed in each well. The colonies are then 
scored for expression of the indicator gene. 

Summary of the Invention 

The present invention relates to a rapid, reliable and effective assay for screening and 
identifying pharmaceutical^ effective compounds that specifically interact with and 
modulate the activity of a cellular receptor or ion channel. The subject assay enables rapid 
screening of large numbers of polypeptides in a library to identifying those polypeptides 
which agonize or antagonize receptor bioactivity. In general, the assay is characterized by 
the use of a library of recombinant cells, each cell of which include (i) a target receptor 
protein whose signal transduction activity can be modulated by interaction with an 
extracellular signal, the transduction activity being able to generate a detectable signal, and 
(ii) an expressible recombinant gene encoding an exogenous test polypeptide from a 
polypeptide library. By the use of a variegated gene library, the mixture of cells collectively 
express a variegated population of test polypeptides. In preferred embodiments, the 
polypeptide library includes at least 10 3 different polypeptides, though more preferably at 
least 10 5 , 10 6 , or 10 7 different (variegated) polypeptides. The polypeptide library can be 
generated as a random peptide library, as a semi-random peptide library (e.g., based on 
combinatorial mutagenesis of a known ligand), or as a cDNA library. 

The ability of particular constituents of the peptide library to modulate the signal 
transduction activity of the target receptor can be scored for by detecting up or down- 
regulation of the detection signal For example, second messenger generation via the 
receptor can be measured directly. Alternatively, the use of a reporter gene can provide a 
convenient readout In any event, a statistically significant change in the detection signal can 
be used to facilitate isolation of those cells from the mixture which contain a nucleic acid 
encoding a test polypeptide which is an effector of the target receptor. 

By this method,~test polypeptides which induce receptor signaling can be identified. 
If the test polypeptide does not appear to directly induce the activity of the receptor protein, 
the assay may be repeated and modified by the introduction of a step in which the 
recombinant cell is first contacted with a known activator of the target receptor to induce the 
signal transdution pathways from the receptor. In one embodiment, the test polypeptide is 
assayed for its ability to antagonize, e.g., inhibit or block the activity of the activator. 
Alternatively, the assay can score for peptides from the peptide library which potentiate the 
induction response generated by treatment of the cell with a known activator. As used 



herein, an "agonist" refers to agents which either induce activation of receptor signalling 
pathways, e.g., such as by mimicking a ligand for the receptor, as well as agents which 
potentiate the sensitivity of the receptor to a ligand, e.g., lower the concentrations of ligand 
required to induce a particular level of receptor-dependent signalling. 

In one embodiment of the present invention the reagent cells express the receptor of 
interest endogenously. In yet other embodiments, the cells are engineered to express a 
heterlogous target receptor protein. In either of these embodiments, it may be desirable to 
inactivate one or more endogenous genes of the host cells. For example, certain preferred 
embodiments in which a heterlogous receptor is provided utilize host cells in which the gene 
for the homologous receptor has been inactivated. Likewise, other proteins involved in 
-transducing signals from the target receptor can be inactivated, or complemented with an 
ortholog or paralog from another organism, e:g., yeast G protein subunits can . be 
complemented by mammalian G protein subunits in yeast cells also engineered to express a 
mammalian G protein coupled receptor. Other complementations include, for example, 
expression of heterologous MAP kinases or erk kinases, MEKs or MKKs (MAP kinase 
kinases), MEKKs (MEK kinases), ras, raf, STATs, JAKs and the like. 

The receptor protein can be any receptor which interacts with an extracellular 
molecule (i.e. hormone, growth factor, peptide) to modulate a signal in the cell. To illustrate 
the receptor can be a cell surface receptor, or in other embodiments can be an intracellular 
receptor. In preferred embodiments, the receptor is a cell surface receptor, such as; a receptor 
tyrosine kinase, e.g., an EPH receptor; an ion channel; a cytokine receptor; an multisubunit 
immune recognition receptor, a chemokine receptor; a growth factor receptor, or a G-protein 
coupled receptor, such as a chemoattracttractant peptide receptor, a neuropeptide receptor, a 
light receptor, a neurotransmitter receptor, or a polypeptide hormone receptor. 

Preferred G protein coupled receptors include alA-adrenergic receptor, alB- 
adrenergic receptor, a2-adrenergic receptor, ct2B-adrenergic receptor, pi -adrenergic 
receptor, p2-adrenergic receptor, p3-adrenergic receptor, ml acetylcholine receptor (AChR), 
m2 AChR, m3 AChR, m4 AChR, m5 AChR, Dl dopamine receptor, D2 dopamine receptor, 
D3 dopamine receptor, D4 dopamine receptor, D5 dopamine receptor, Al adenosine receptor, 
A2b adenosine receptor, 5-HTla receptor, 5-HTlb receptor, 5HTl-like receptor, 5-HTld 
"receptor, 5HTld-like receptor, 5HTld beta receptor, substance K (neurokinin A) receptor, 
fMLP receptor, fMLP-like receptor, angiotensin II type 1 receptor, endothelin ETA receptor, 
endothelin ETB receptor, thrombin receptor, growth hormone-releasing hormone (GHRH) 
receptor, vasoactive intestinal peptide receptor, oxytocin receptor, somatostatin SSTR1 and 
SSTR2, SSTR3, cannabinoid receptor, follicle stimulating hormone (FSH) receptor, leutropin 
(LH/HCG) receptor, thyroid stimulating hormone (TSH) receptor, thromboxane A2 receptor, 
platelet-activating factor (PAF) receptor, C5a anaphylatoxin receptor, Interleukin 8 (IL-8) IL-. 



8RA, IL-8RB, Delta Opioid receptor, Kappa Opioid receptor, mip-l/RANTES receptor, 
Rhodopsin, Red opsin, Green opsin, Blue opsin, metabotropic glutamate mGluRl-6, 
histamine H2 receptor, ATP receptor, neuropeptide Y receptor, amyloid protein precursor 
receptor, insulin-like growth factor II receptor, bradykinin receptor, gonadotropin-releasing 
hormone receptor, cholecystokinin receptor, melanocyte stimulating hormone receptor 
receptor, antidiuretic hormone receptor, glucagon receptor, and adrenocorticotropic hormone 
II receptor. 

Preferred EPH receptors inlcude eph, elk, eck> sek y mek4, hek, hek2, eek 9 erk, tyrol, 
tyro4 y tyro5 y tyro6, tyroll, cek4, cek5, cek6 9 cek7, cek8, cek9, ceklO, bsk, rtkl, rtk2, rtk3, 
mykly myk2, ehkl, ehk2,pagliaccio, htk, erk and nuk receptors. 

As set forth below, no matter which structural/function class to which the target 
receptor may belong, the subject assay is amenable to identifying ligands for an otherwise 
orphan receptor. 

In those embodiments wherein the target receptor is a cell surface receptor, it will be 
desirable for the peptides in the library to express a signal sequence to ensure that they are 
processed in the appropriate secretory pathway and thus are available to interact with 
receptors on the cell surface. 

With respect to a detection signal generated by signal transduction, certain of the 
preferred embodiments measure the production of second messengers to determine changes 
in ligand engagement by the receptor. In preferred embodiments, changes in GTP hydrolysis, 
calcium mobilization, or phospholipid hydrolysis can be measured. 

In other preferred embodiment, the host cells harbors a reporter construct containing a 
reporter gene in operative linkage with one or more transcriptional, regulatory elements 
responsive to the signal transductin activity of the receptor protein. Exemplary reporter 
genes include enzymes, such as luciferase, phosphatase, or P-galactosidase which can 
produce a spectrometrically active label, e.g., changes in color, fluorescence or luminescence, 
or a gene product which alters a cellular phenotype, e.g., cell growth, drug resistance or 
auxotrophy. In preferred embodiments: the reporter gene encodes a gene product selected 
from the group consisting of chloramphenicol acetyl transferase, beta-galactosidase and 
secreted alkaline phosphatase; the reporter gene-encodes a gene product which confers a 
growth signal; the reporter gene encodes a gene product for growth in media containing 
aminotriazole or canavanine. 

The reagent cells of the present invention can be derived from any eukaryotic 
organism. In preferred embodiments the cells are mammalian cells. In more preferred 
embodiments the cells are yeast cells, with cells from the genera Saccharomyces or 
Schizosaccharomyces being more preferred. However, cells from amphibia (such as 
xenopus), avian or insect sources are also contemplated. The host cells can derived from 



primary cells, or transformed and/or immortalized cell lines. 
In another aspect, the present invention provides 

Brief Description of the Drawings 

Figure 1. Structures of pAAH5 and pRS-ADC. 

Figure 2. Schematic diagram of the structure of the plasmid resulting from insertion 
of random oligonucleotides into pADC-MF alpha. This plasmid expresses random peptides 
in the context of the MF alpha 1 signal and leader peptide. 

Figure 3. Schematic diagram of the structure of the plasmid resulting from insertion 
of random oligonucleotides into pADC-MFa. This plasmid expresses random peptides in the 
context of the MFal leader and C-terminal CVIA tetrapeptide. 

Figure 4. Activity of a fusl promoter in response to signaling by human C5a 
expressed in autocrine strains of yeast. 

Figure 5 . Exemplary set of steps for isolating surrogate ligands for the C5a receptor. 

Figure 6. Spotting a lawn of recombinant yeast cells with various C5a receptor 
agonists or DMF solvent control. 

Figure 7. Hie amino acid sequence for C5a surrogate agonist peptides. 

Figure 8. Dose response curve for various C5a receptor surrogate peptide ligands 
based on a colorimetric lacZ readout. 

Figure 9. Expression of a lacZ reporter gene construct, engineered into the 
- mammalian HEK293 cell-Jtne,4n response to stimulation of a C5a receptor by a C5a receptor 
agonist. 

Figure 10. Dose-response curves comparing a second generation C5a receptor 

agonist (122modl-5) with other known C5a receptor agonsts. 

Figure 11. Autocrine activation of the pheromone response pathway in yeast 
expressing FPRL-1 agonists or C5a receptor agonists. 
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Figure 12. Intracellular Ca 44 " mobilization in neutrophils as detected by fluorescence 
activated Cell Sorter analysis using FURA2 dye absorbance ratio. The measurements were 
performed for the C5a peptide, or no peptide (control), or varying concentrations of the A5 
peptide. 

Detailed Description of the Invention 

Proliferation, differentiation and death of eukaryotic cells are controlled by hormones, 
neurotransmitters, and polypeptide factors. These diffusible ligands allow cells to influence 
and be influenced by environmental cues. The study of receptor-ligand interaction has 
revealed a great deal of information about how cells respond to external stimuli, and this 
knowledge has led to the development of therapeutically important compounds. However, 
the rate at which receptors have been cloned has recently increased dramatically — existing 
families have been extended and new families recognized. In particular, the application of 
advanced cloning approaches has allowed the isolation of many receptors for which ligands 
are initially unknown. These are commonly referred to in the art as "orphan" receptors, and 
several have subsequently proved to be important pharmacological targets. 

The present invention makes available a rapid, effective assay for screening and 
identifying pharmaceutical^ effective compounds that specifically interact with and 
modulate the activity of a cellular receptor or ion channel. The subject assay enables rapid 
screening of large numbers of polypeptides in a library to identifying those polypeptides 
which induce or antagonize receptor bioactivity. 

In general, the assay is characterized by the use of a mixture of recombinant cells to 
sample a variegated polypeptide library for receptor agonists or antagonists. As described 
with greater detail below, the reagent cells express both a target receptor protein capable of 
transducing a detectable signal in the reagent cell, and a test polypeptide for which 
interaction with the receptor is to be ascertained. Collectively, a culture of such reagent cells 
will provide a variegated library of potential receptor effectors and those members of the 
library which either agonize or antagonize the receptor function can be selected and identified 
by sequence. 

One salient feature of the subject assay is the enhanced sensitivity resulting from 
expression of the test polypeptide in a cell which also serves as a reporter for the desired 
receptor-ligand interaction. To illustrate, where the detectable signal resulting from receptor 
engagement by an agonist provides a growth signal or drug resistance, individual cells 
expressing polypeptides which agonize receptor function can be amplified and isolated from 
a library culture. 

Accordingly, the present invention provides a convenient format for discovering 
drugs which can be useful to modulate cellular function, as well as to understand the 



pharmacology of compounds that specifically interact with cellular receptors or ion channels. 
Moreover, the subject assay is particularly amenable to identifying ligands, natural or 
artifical, for orphan receptors. 

Before further description of the invention, certain terms employed in the 
specification, examples and appended claims are, for convenience, collected here. 

As used herein, "recombinant cells" include any . cells that have been modified by the 
introduction of heterologous DNA. Control cells include cells that are substantially identical 
to the recombinant cells, but do not express one or more of the proteins encoded by the 
heterologous DNA, e.g., do not include or express the reporter gene construct, receptor or test 
polypeptide. 

The terms "recombinant protein", "heterologous protein" and "exogenous protein" are 
used interchangeably throughout the specification and refer to a polypeptide which is , 
produced by recombinant DNA techniques, wherein generally, DNA encoding the 
polypeptide is inserted into a suitable expression vector which is in turn used to transform a 
host cell to produce the heterologous protein. That is, the polypeptide is expressed from a 
heterologous nucleic acid. 

As used herein, "heterologous DNA" or "heterologous nucleic acid" include DNA 
that does not occur naturally as part of the genome in which it is present or which is found in 
a location or locations in the genome that differs from that in which it occurs in nature. 
Heterologous DNA is not endogenous to the cell into which it is introduced, but has been 
obtained from another cell. Generally, although not necessarily, such DNA encodes RNA 
and proteins that are not normally produced by the cell in which it is expressed. 
Heterologous DNA may also be referred to as foreign DNA. Any DNA that one of skill in 
the art would recognize or consider as heterologous or foreign to the cell in which is 
expressed is herein encompassed by heterologous DNA. Examples of heterologous DNA 
include, but are not limited to, DNA that encodes test polypeptides, receptors, reporter genes, 
transcriptional and translational regulatory sequencespselectable or traceable marker proteins, 
such as a protein that confers drug resistance. 

As used herein, "cell surface receptor" refers to molecules that occur on the surface of 
cells, interact with the extracellular environment, and transmit or transduce the information 
regarding the environment intracellularly in a manner that ultimately modulates transcription 
of specific promoters, resulting in transcription of specific genes. 

As used herein, "extracellular signals" include a molecule or a change in the ' 
environment that is transduced intracellularly via cell surface proteins that interact, directly 
or indirectly, with the signal. An extracellular signal or effector molecule includes any 
compound or substance that in some manner specifically alters the activity of a cell surface 
protein. Examples of such signals include, but are not limited to, molecules such as 



acetylcholine, growth factors and hormones, that bind to cell surface and/or intracellular 
receptors and ion channels and modulate the activity of such receptors and channels. 

As used herein, "extracellular signals" also include as yet unidentified substances that 
modulate the activity of a cellular receptor, and thereby influence intracellular functions. 
Such extracellular signals are potential pharmacological agents that may be used to treat 
specific diseases by modulating the activity of specific cell surface receptors. 

"Orphan receptors" is a designation given to a receptors for which no specific natural 
ligand has been described. 

As used herein, a "reporter gene construct" is a nucleic acid that includes a "reporter 
gene" operatively linked to a transcriptional regulatory sequences. Transcription of the 
reporter gene is controlled by these sequences. The activity of at least one or more of these 
control sequences is directly or indirectly regulated by the target receptor protein. The 
transcriptional regulatory sequences include the promoter and other regulatory regions, such 
as enhancer sequences, that modulate the activity of the promoter, or regulatory sequences 
that modulate the activity or efficiency of the RNA polymerase that recognizes the promoter, 
or regulatory sequences are recognized by effector molecules, including those that are 
specifically induced by interaction of an extracellular signal with the target receptor. For 
example, modulation of the activity of the promoter may be effected by altering the RNA 
polymerase binding to the promoter region, or, alternatively, by interfering with initiation of 
transcription or elongation of the mRNA. Such sequences are herein collectively referred to 
as transcriptional regulatory elements or sequences. In addition, the construct may include 
sequences of nucleotides that alter translation of the resulting mRNA, thereby altering the 
amount of reporter gene product. \ 

"Signal transduction" is the processing of chemical signals from the cellular 
environment through the cell membrane, and may occur through one or more of several 
mechanisms, such as phosphorylation, activation of ion channels, effector enzyme activation 
via guanine nucleotide binding protein intermediates, formation of inositol phosphate, 
activation of adenylyl cyclase, and/or direct activation (or inhibition) of a transcriptional 
factor. 

The term "modulation of a -signal transduction activity of a receptor protein" in its 
various grammatical forms, as used herein, designates induction and/or potentiation, as well 

as inhibition of one or more signal transduction pathways downstream of a receptor. 

Agonists and antagonists are "receptor effector" molecules that modulate signal 
transduction via a receptor. Receptor effector molecules are capable of binding to the 
receptor, though not necessarily at the binding site of the natural ligand. Receptor effectors 
can modulate, signal transduction when used alone, i.e. can be surrogate ligands, or can alter 
signal transduction in the presence ot the natural ligand, either to enhance or inhibit signaling 



by the natural ligand. For example, "antagonists' 1 are molecules that block or decrease the 
signal transduction activity of receptor, e.g., they can competitively, noncompetitively, 
and/or allosterically inhibit signal transduction from the receptor, whereas "agonists" 
potentiate, induce or otherwise enhance the signal transduction activity of a receptor. The 
terms "receptor activator" and "surrogate ligand" refer to an agonist which induces signal 
transduction from a receptor. 

The term "substantially homologous", when used in connection with amino acid 
sequences, refers to sequences which are substantially identical to or similar in sequence, 
giving rise to a homology in conformation and thus to similar biological activity. The term is 
not intended to imply a common evolution of the sequences. 

Typically, "substantially homologous" sequences are at least 50%, more preferably at 
least 80%, identical in sequence, at least over any regions known to be involved in the 
desired activity. Most preferably, no more than five residues, other than at the termini, are 
different. Preferably, the divergence in sequence, at least in the aforementioned regions, is in 
the form of "conservative modifications". 

The term "autocrine cell", as used herein, refers to a cell which produces a substance 
which can stimulate a receptor located on or within the same cell as produces the substance. 
For .example, wild-type yeast a and a cells are not autocrine. However, a yeast cell which 
produces both a-factor and a-factor receptor, or both a-factor and a-factor receptor, in 
functional form, is autocrine. By extension, cells which produce a peptide which is being 
screened for the ability to activate a receptor (e.g., by activating a G protein-coupled 
receptor) express the receptor are called "autocrine cells", though it might be more precise to 
call them "putative autocrine cells". Of course, in a library of such cells, in which a multitude 
of different peptides are produced, it is likely that one or more of the cells will be "autocrine" 
in the stricter sense of the term. 

The terms "protein", "polypeptide" and "peptide" are used interchangeably herein. 
I. Overview of Assay 

As set out above, the present invention relates to methods for identifying effectors of 
a receptor protein or complex thereof. In general, the assay is characterized by the use of a 
library of recombinant cells, each cell of which include (i) a target receptor protein whose 
signal transduction activity can be modulated by interaction with an extracellular signal, the 
transduction activity being able to generate a detectable signal, and (ii) an expressible 
recombinant gene encoding an exogenous test polypeptide from a polypeptide library. By the 
use of a variegated gene library, the mixture of cells collectively express a variegated 
population of test polypeptides. 



The ability of particular constituents of the peptide library to modulate the signal 
transduction activity of the target receptor can be scored for by detecting up or down- 
regulation of the detection signal. For example, second messenger generation (e.g. GTPase 
activity, phospholipid hydrolysis, or protein phosphorylation) via the receptor can be 
measured directly. Alternatively, the use of a reporter gene can provide a convenient readout. 
In any event, a statistically significant change in the detection signal can be used to facilitate 
isolation of those cells from the mixture which contain a nucleic acid encoding a test 
polypeptide which is an effector of the target receptor. 

By this method, test polypeptides which induce the receptor's signaling can be 
screened. If the test polypeptide does not appear to induce the activity of the receptor 
protein, the assay may be repeated and modified by the introduction of a step in which the 
recombinant cell is first contacted with a known activator of the target receptor to induce 
signal transduction from the receptor, and the test polypeptide is assayed for its ability to 
inhibit the activity of the receptor, e.g., to identify receptor antagonists. In yet other 
embodiments, the peptide library can be screened for members which potentiate the response 
to a known activator of the receptor; In this respect, surrogate ligands identified by the 
present assay for orphan receptors can be used as the exogenous activator, and further peptide 
libraries screened for members which potentiate or inhibit the activating peptide. 
Alternatively, the surrogate ligand can be used to screen exogenous compound libraries 
(peptide and non-peptide) whichj by modulating the activity of the identified surrogate, will 
presumably also similarly effect the native ligand's effect on the target receptor. In such 
embodiments, the surrogate ligand can be applied to the cells, though is preferably produced 
by the reagent cell, thereby providing an autocrine cell. 

In developing the recombinant cells assays, it was recognized that a frequent result of 
receptor-mediated responses to extracellular signals was the transcriptional acitivation or 
inactivation of specific genes after exposure of the cognate receptor to an extracellular signal 
that induces such activity. Thus, transcription of genes controlled by receptor-responsive 
transcriptional elements often reflects the activity of the surface protein by virtue of 
transduction of an intracellular signal; 

To illustrate, the intracellular signal that is transduced can be initiated by the specific 
interaction of an extracellular signal, particularly a ligand, with a cell surface receptor on the 
cell. This interaction sets in motion a cascade of intracellular events, the ultimate 
consequence of which is a rapid and detectable change in the transcription or translation of a 
gene. By selecting transcriptional regulatory sequences that are responsive to the transduced 
intracellular signals and operatively linking the selected promoters to reporter genes, whose 
transcription, translation or ultimate activity is readily detectable and measurable, the 
transcription based assay provides a rapid indication of whether a specific receptor or ion 



channel interacts with a test peptide in any way that influences intracellular transduction. 
Expression of the reporter gene, thus, provides a valuable screening tool for the development 
of compounds that act as agonists or antagonists of a cell receptor or ion channel. 

Reporter gene based assays of this invention measure the end stage of the above 
described cascade of events, e.g., transcriptional modulation. Accordingly, in practicing one 
embodiment of the assay, a reporter gene construct is inserted into the reagent cell in order to 
generate a detection signal dependent on receptor signaling. Typically, the reporter gene 
construct will include a reporter gene in operative linkage with one or more transcriptional 
regulatory elements responsive to the signal transduction activity of the target receptor, with 
the level of expression of the reporter gene providing the receptor-dependent detection signal. 
The amount of transcription from the reporter gene may be measured using any method 
known to those of skill in the art to be suitable. For example, specific mRNA expression may 
be detected using Northern blots or specific protein product may be identified by a 
characteristic stain or an intrinsic activity. 

In preferred embodiments, the gene product of the reporter is detected by an intrinsic 
activity associated with that product. For instance, the reporter gene may encode a gene 
product that, by enzymatic activity, gives rise to a detection signal based on color, 
fluorescence, or luminescence. 

The amount of expression from the reporter gene is then compared to the amount of 
expression in either the same cell in the absence of the test compound or it may be compared 
with the amount of transcription in a substantially identical cell that lacks the specific 
receptors. A control cell may be derived from the same cells from which the recombinant cell 
was prepared but which had not been modified by introduction of heterologous DNA, e.g., 
the encoding the, test polypeptide. Alternatively, it may be a cell in which the specific 
receptors are removed. Any statistically or otherwise significant difference in the amount of 
transcription indicates that the test polypeptide has in some manner altered the activityjDf the 
specific receptor. " 

In other preferred embodiments, the reporter or marker gene provides a selection 
method such that cells in which the peptide is a ligand for the receptor have a growth 
advantage. For example the reporter could enhance cell viability, relieve a cell nutritional 
requirement, and/or provide resistance to a drug. 

With respect to the target receptor, it may be endogenously expressed by the host cell, 
or it may be expressed from a heterologous gene that has been introduced into the cell. 
Methods for introducing heterologous DNA into eukaryotic cells are of course well known in 
the art and any such method may be used. In addition, DNA encoding various receptor 
proteins is known to those of skill in the art or it may be cloned by any method known to 
those of skill in the art. In certain embodiments, such as when an exogenous receptor is 



expressed, it may be desirable to inactivate, such as by deletion, a homologous receptor 
present in the cell. 

The subject assay is useful for identifying polypeptides that interact with any receptor 
protein whose activity ultimately induces a signal transduction cascade in the host cell which 
can be exploited to produce a detectable signal. In particular, the assays can be used to test 
functional ligand-receptor or ligand-ion channel interactions for cell surface-localized 
receptors and channels, and also for cytoplasmic and nuclear receptors. As described in more 
detail below, the subject assay can be used to identify effectors of, for example, G protein- 
coupled receptors, receptor tyrosine kinases, cytokine receptors, and ion channels, as well as 
steroid hormone receptors. In preferred embodiments the method described herein is used for 
identifying ligands for "orphan receptors" for which no ligand is known. 

In embodiments in which cell surface receptors are the assay targets, it will be 
desirable for each of the peptides of the peptide library to include a signal sequence for 
secretion, e.g., which will ensure appropriate transport of the peptide to the endoplasmic 
reticulum, the golgi, and ultimately to the cell surface so that it is able. to interact with cell 
surface receptors. In the case of yeast cells, the signal sequence will transport peptides to the 
periplasmic space. 

Any transfectable cell that can express the desired cell surface protein in a manner 
such the protein functions to intracellularly transduce an extracellular signal may be used. 
The cells may be selected such that they endogenously express the target receptor protein or 
may be genetically engineered to do so. 

The preparation of cells which express the orphan FPRL1 receptor, a peptide library, 
and a reporter gene expression construct, are described. These cells have been used to 
identify a novel ligand for this receptor. The cells for the identification of receptor ligands 
and in drug screening assays to discover agents capable of modulating receptor activity. 

Any cell surface protein that is known to those of skill in the art or that may be 
identified by those of skill in the art may used in the assay. The cell surface protein may 
endogenously expressed on the selected cell or it may be expressed from cloned DNA. 

II Host Cells 

Suitable host cells for generating the subject assay include prokaryotes, yeast, or 
higher eukaryotic cells, especially mammalian cells. Prokaryotes include gram negative or 
gram positive organisms. Examples of suitable mammalian host cell lines include the COS-7 
line of monkey kidney cells (ATCC CRL 1651) (Gluzman (1981) Cell 23:175) CV-1 cells 
(ATCC CCL 70), L cells, CI 27, 3T3, Chinese hamster ovary (CHO), HeLa and BHK cell 
lines. 



If yeast cells are used, the yeast may be of any species which are cultivable and in 
which an exogenous receptor can be made to engage the appropriate signal transduction 
machinery of the host cell. Suitable species include Kluyverei lactis, Schizosaccharomyces 
pombe, and Ustilaqo maydis; Saccharomyces cerevisiae is preferred. Other yeast which can 
be used in practicing the present invention are Neurospora crassa, Aspergillus niger, 
Aspergillus nidulans, Pichia pastoris, Candida tropicalis, and Hansenula polymorpha. The 
term "yeast", as used herein, includes not only yeast in a strictly taxonomic sense, i.e., 
unicellular organisms, but also yeast-like multicellular fungi or filamentous fungi. 

The choice of appropriate host cell will also be influenced by the choice of detection 
signal. For instance, reporter constructs, as described below, can provide a selectable or 
screenable trait upon transcriptional activation (or inactivation) in response to a signal 
transduction pathway coupled to the target receptor. The reporter gene may be an 
unmodified gene already in the host cell pathway, such as the genes responsible for growth 
arrest in yeast. It may be a host cell gene that has been operably linked to a 
"receptor-responsive" promoter. Alternatively, it may be a heterologous gene that has been so 
linked. Suitable genes and promoters are discussed below. In other embodiments, second 
messenger generation can be measured directly in the detection step, such as mobilization of 
intracellular calcium or phospholipid metabolism are quantitated. Accordingly, it will be 
understood that to achieve selection or screening, the host cell must have an appropriate 
phenotype. For example, introducing a pheromone-responsive chimeric HIS3 gene into a 
yeast that has a wild-type HIS3 gene would frustrate genetic selection. Thus, to achieve 
nutritional selection, an auxotrophic strain is wanted. 

To further illustrate, in a preferred embodiment of the subject assay using a yeak host 
cell, the yeast cells possess one or more of the following characteristics: (a) the endogenous 
FUS1 gene has been inactivated; (b) the endogenous SST2 gene, and/or other genes involve 
in desensitization, has been inactivated; (c) if there is a homologous, endogenous receptor 
gene it has been inactivated; and (d) if the yeast produces an endogenous ligand to the 
exogenous receptor, the genes encoding for the ligand been inactivated. 

Other complementations for use in the subject assay can be constructed without any 
undue experimentation. Indeed, many yeast genetic complementation with mammalian 
signal transduction proteins have been described in the art. For example, Mosteller et al. 
(1994) Mol Cell Biol 14:1 104-12 demonstrates that human Ras proteins can complement loss 
of ras mutations in S. cerevisiae. Moreover, Toda et al. (1986) Princess Takamatsu Symp 17: 
253-60 have shown that human ras proteins can complement the loss of RAS1 and RAS2 
proteins in yeast, and hence are functionally homologous. Both human and yeast RAS 
proteins can stimulate the magnesium and guanine nucleotide-dependent adenylate cyclase 
activity present in yeast membranes. Ballester et al. (1989) Cell 59: 681-6 describe a vector 



to express the mammalian GAP protein in the yeast S. cerevisiae. When expressed in yeast, 
GAP inhibits the function of the human ras protein, and complements the loss of IRA1 . IRA1 
is a yeast gene that encodes a protein with homology to GAP and acts upstream of RAS. 
Mammalian GAP can therefore function in yeast and interact with yeast RAS. Wei et al. 
(1994) Gene 151: 279-84 describes that a human Ras-specific guanine nucleotide-exchange 
factor, Cdc25GEF, can complement the loss of CDC25 function in S. cerevisiae. Martegani 
et al. (1992) EMBO J 11: 2151-7 describe the cloning by functional complementation of a 
mouse cDNA encoding a homolog of CDC25, a Saccharomyces cerevisiae RAS activator. 
Vojtek et al. (1993) J Cell Sci 105: 777-85 and Matviw et al. (1992) Mol Cell Biol 12: 5033- 
.40 describe how a mouse CAP protein, e.g., an adenylyl cyclase associated protein associated 
with ras-mediated signal transduction, can complements defects in S. cerevisiae. Papasawas 
et al. (1992) Biochem Biophys Res Commun 184:1378-85 also suggest that inactivated yeast 
adenyl cyclase can be complemented by a mammalian adenyl cyclase gene. Hughes et al. 
(1993) Nature 364: 349-52 describe the complementation of byrl in fission yeast by 
mammalian MAP kinase kinase (MEK). Parissenti et al. (1993) Mol Cell Endocrinol 98: 9-16 
describes the reconstitution of bovine protein kinase C (PKC) in yeast. The Ca(2+)- and 
phospholipid-dependent Ser/Thr kinase PKC plays important roles in the transduction of 
cellular signals in mammalian cells. Marcus et al. (1995) PNAS 92: 6180-4 suggests the 
complementation of shkl null mutations in S. pombe by the either the structurally related S. 
cerevisiae Ste20 or mammalian p65P AK protein kinases. 

"Inactivation", with respect to genes of the host cell, means that production of a 
functional gene product is prevented or inhibited. Inactivation may be achieved by deletion of 
the gene, mutation of the promoter so that expression does not occur, or mutation of the 
coding sequence so that the gene product is inactive. Inactivation may be partial or total. 

"Complementation", with respect to genes of the host cell, means that at least partial 
function of inactivated gene of the host cell is supplied by an exogenous nucleic acid. For 
instance, yeast cells can be "mammalianized", and even "humanized", by complementation of 
receptor and signal transduction proteins with mammalian homologs. To illustrate, 
inactivation of a yeast Byr2/Stell gene can be complemented by expression of a human 
MEKKgene. 

HI. Expression Systems 

Ligating a polynucleotide coding sequence into a gene construct, such as an 
expression vector, and transforming or transfecting into hosts, either eukaryotic (yeast, 
avian, insect or mammalian) or prokaryotic (bacterial cells), are standard procedures used in 
producing other well-known proteins, including sequences encoding exogenous receptor and 
peptide libraries. Similar procedures, or modifications thereof, can be employed to prepare 



recombinant reagent cells of the present invention by tissue-culture technology in accord with 
the subject invention. 

In general, it will be desirable that the vector be capable of replication in the host cell. 
It may be a DNA which is integrated into the host genome, and thereafter is replicated as a 
part of the chromosomal DNA, or it may be DNA which replicates autonomously, as in the 
case of a plasmid. In the latter case, the vector will include an origin of replication which is 
functional in the host. In the case of an integrating vector, the vector may include sequences 
which facilitate integration, e.g., sequences homologous to host sequences, or encoding 
integrases. 

Appropriate cloning and expression vectors for use with bacterial, fungal, yeast, and 
mammalian cellular hosts are known in the art, and are described in, for example, Powels et 
al. (Cloning Vectors: A Laboratory Manual, Elsevier, New York, 1985). Mammalian 
expression vectors may comprise non-transcribed elements such as an origin of replication, a 
suitable promoter and enhancer linked to the gene to be expressed, and other 5 f or 3' flanking 
nontranscribed sequences, and 5' or 3' nontranslated sequences, such as necessary ribosome 
binding sites, a poly-adenylation site, splice donor and acceptor sites, and transcriptional 
termination sequences. 

The preferred mammalian expression vectors contain both prokaryotic sequences, to 
facilitate the propagation of the vector in bacteria, and one or more eukaryotic transcription 
units that are expressed in eukaryotic cells. The pcDNAI/amp, pcDNAI/neo, pRc/CMV, 
pSV2gpt, pSV2neo, pSV2-dhfr, pTk2, pRSVneo, pMSG, pSVT7, pko-neo and pHyg derived 
vectors are examples of mammalian expression vectors suitable for transfection of eukaryotic 
cells. Some of these vectors are modified with sequences from bacterial plasmids, such as 
pBR322, to facilitate replication and drug resistance selection in both prokaryotic and 
eukaryotic cells. Alternatively, derivatives of viruses such as the bovine papillomavirus 
(BPV-1), or Epstein-Barr virus (pHEBo, pREP-derived and p205) can be used for transient 
expression of proteins in eukaryotic cells. The various methods employed in the preparation 
of the plasmids and transformation of host organisms are well known in the art. For other 
suitable expression systems for both prokaryotic and eukaryotic cells, as well as general 
recombinant procedures, see Molecular Cloning A Laboratory Manual, 2nd Ed., ed. by 
Sambrook, Fritsch and Maniatis (Cold Spring Harbor Laboratory Press: 1989) Chapters 16 
and 17. 

The transcriptional and translational control sequences in expression vectors to be 
used in transforming mammalian cells may be provided by viral sources. For example, 
commonly used promoters and enhancers are derived from Polyoma, Adenovirus 2, Simian 
Virus 40 (SV40), and human cytomegalovirus. DNA sequences derived from the SV40 viral 
genome, for example, SV40 origin, early and late promoter, enhancer, splice, and 
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polyadenylation sites may be used to provide the other genetic elements required for 
expression of a heterologous DNA sequence. The early and late promoters are particularly 
useful because both are obtained easily from the virus as a fragment which also contains the 
SV40 viral origin of replication (Fiers et al.(1978) Nature 273:1 1 1) Smaller or larger SV40 
fragments may also be used, provided the approximately 250 bp sequence extending from the 
Hind III site toward the Bgl I site located in the viral origin of replication is included. 
Exemplary vectors can be constructed as disclosed by Okayama and Berg (1983, Mol Cell 
Biol. 3:280). A useful system for stable high level expression of mammalian receptor cDNAs 
in CI 27 murine mammary epithelial cells can be constructed substantially as described by 
Cosman et al (1986, Mol Immunol 23:935). Other expression vectors for use in mammalian 
host cells are derived from retroviruses. 

In other embodiments, the use of viral transfection can provide stably integrated 
copies of the expression construct. In particular, the use of retroviral, adenoviral or adeno- 
associated viral vectors is contemplated as a means for providing a stably transfected cell line 
which expresses an exogenous receptor, and/or a polypeptide library. 

A number of vectors exist for the expression of recombinant proteins in yeast. For 
instance, YEP24, YIP5, YEP51, YEP52, pYES2, and YRP17 are cloning, and expression 
vehicles useful in the introduction of genetic constructs into S. cerevisiae (see, for example, 
Broach et al (1983) in Experimental Manipulation of Gene Expression, ed. M. Inouye 
Academic Press, p. 83, incorporated by reference herein). These vectors can replicate in E. 
coli due the presence of the pBR322 ori, and in S. cerevisiae due to the replication 
determinant of the yeast 2 micron plasmid. In addition, drug resistance markers such as 
ampicillin can be used. Moreover, if yeast are used as a host cell, it will be understood that 
the expression of a gene in a yeast cell requires a promoter which is functional in yeast. 
Suitable promoters include the promoters for metallothionein, 3-phosphoglycerate kinase 
(Hitzeman et al., J. Biol Chem. 255, 2073 (1980) or other glycolytic enzymes (Hess et al., 1 
Adv. Enzyme Req. 7^149 (1968); and Holland et al. Biochemistry 17, 4900 (1978)), such as 
enolase, glyceraldehyde-3^phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, 
phospho-fructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, pyruvate 
kinase, triosephosphate isomerase, phospho-glucose isomerase, and glucokinase. Suitable 
vectors and promoters for use in yeast expression are further described in R. Hitzeman et al., 
EPO Publn. No. 73,657. Other promoters, which have the additional advantage of 
transcription controlled by growth conditions, are the promoter regions for alcohol 
dehydrogenase 2, isocytochrome C, acid phosphatase, degradative enzymes associated with 
nitrogen metabolism, and the aforementioned metallothionein and 
glyceraldehyde-3-phosphate dehydrogenase, as well as enzymes responsible for maltose and 
galactose utilization. Finally, promoters that are active in only one of the two haploid mating 
types may be appropriate in certain circumstances. Among these haploid-specific promoters, 



the pheromone promoters MFal and MFal are of particular interest. 

In some instances, it may be desirable to derive the host cell using insect cells. In 
such embodiments, recombinant polypeptides can be expressed by the use of a baculovirus 
expression system. Examples of such baculovirus expression systems include pVL-derived 
vectors (such as pVL1392, pVL1393 and pVL941), pAcUW-derived vectors (such as 
pAcUWl), and pBlueBac-derived vectors (such as the fi-gal containing pBlueBac III). 

Libraries of random peptides or cDNA fragments may be expressed in a multiplicity 
of ways, including as portions of chimeric proteins. As described below, where secretion of 
the peptide library is desired, the peptide library can be engineered for secretion or transport 
to the extracellular space via the yeast pheromone system 

In constructing suitable expression plasmids, the termination sequences associated 
with these genes, or with other genes which are efficiently expressed in yeast, may also be 
ligated into the expression vector 3 ! of the heterologous coding sequences to provide 
polyadenylation and termination of the mRNA. 

IV. Periplasmic Secretion 1 

If yeast cells are used as the host cell it will be noted that the yeast cell is bounded by 
a lipid bilayer called the plasma membrane. Between this plasma membrane and the cell wall 
is the periplasmic space. Peptides secreted by yeast cells cross the plasma membrane through 
a variety of mechanisms and thereby enter the periplasmic space. The secreted peptides are 
then free to interact with other molecules that are present in the periplasm or displayed on the 
outer surface of the plasma membrane. The peptides then either undergo re-uptake into the 
cell, diffuse through the cell wall into the medium, or become degraded within the 
periplasmic space. 

The test polypeptide library may be secreted into the periplasm by any of a number of 
exemplary mechanisms, depending on the nature of the expression system to which they are 
linked. In one embodiment, the peptide may be structurally linked to a yeast signal sequence, 
such as that present in the a-factor precursor, which directs secretion through the 
_ endoplasmic reticulum and-Golgi apparatus. Since this is the same route that the receptor 
protein follows in its journey to the plasma membrane, opportunity exists in cells expressing 
both the receptor and the peptide library for a specific peptide to interact with the receptor 
during transit through the secretory pathway. This has been postulated to occur in 
mammalian cells exhibiting autocrine activation. Such interaction could yield activation of 
the response pathway during transit, which would still allow identification of those cells 
expressing a peptide agonist. For situations in which peptide antagonists to externally applied 
receptor agonist are sought, this system would still be effective, since both the peptide 
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antagonist and receptor would be delivered to the outside of the cell in concert. Thus, those 
cells producing an antagonist would be selectable, since the peptide antagonist would be 
properly and timely situated to prevent the receptor from being stimulated by the externally 
applied agonist. 

An alternative mechanism for delivering peptides to the periplasmic space is to use 
the ATP-dependent transporters of the STE6/MDR1 class. This transport pathway and the 
signals that direct a protein or peptide to this pathway are not as well characterized as is the 
endoplasmic reticulum-based secretory pathway. Nonetheless, these transporters apparently 
can efficiently export certain peptides directly across the plasma membrane, without the 
peptides having to transit the ER/Golgi pathway. It is anticipated that at least a subset of 
peptides can be secreted through this pathway by expressing the library in context of the 
a-factor prosequence and terminal tetrapeptide. The possible advantage of this system is that 
the receptor and peptide da not come into contact until both are delivered to the external 
surface of the cell. Thus, this system strictly mimics the situation of an agonist or antagonist 
that is normally delivered from outside the cell. Use of either of the described pathways is 
within the scope of the invention. 

The present invention does not require periplasmic secretion, or, if such secretion is 
provided, any particular secretion signal or transport pathway. 

V. Cytokine Receptors 

In one embodiment the target receptor is a cytokine receptor. Cytokines are a family 
of soluble mediators of cell-to-cell communication that includes interleukins, interferons, and 
colony-stimulating factors. The characteristic features of cytokines lie in their functional 
redundancy and pleiotropy. Most of the cytokine receptors that constitute distinct 
superfamilies do not possess intrinsic protein tyrosine kinase domains, yet receptor 
stimulation usually invokes rapid tyrosine phosphorylation of intracellular proteins, including 
the receptors themselves. Many members of the cytokine receptor superfamily acitvate the 
Jak protein tyrosine kinase family, with resultant phosphorylation of the STAT 
transcriptional activator factors. IL-2, IL-7, IL-2 and Interferon 7 have all been shown to 
activate Jak kinases (Frank et al (1995) Proc Natl Acad Sci USA 92:7779-7783); Scharfe et 
al. (1995) Blood 86:2077-2085); (Bacon et al. (1995) Proc Natl Acad Sci USA 92:7307- 
731 1); and (Sakatsume et al (1995) I Biol Chem 270:17528-17534). Events downstream of 
Jak phosphorylation have also been elucidated. For example, exposure of T lymphocytes to 
IL-2 has been shown to lead to the phosphorylation of signal transducers and activators of 
transcription (STAT) proteins STAT la, STAT2P, and STAT3, as well as of two STAT- 
related proteins, p94 and p95. The STAT proteins were found to translocate to the nucleus 
and to bind to a specific DNA sequence, thus suggesting a mechanism by which IL-2 may 
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activate speicfic genes involved in immune cell function (Frank et al. supra). Jak3 is 
associated with the gamma chain of the IL-2, IL-4, and IL-7 cytokine receptors (Fujii et al. 
(1995) Proc Natl Acad Sci 92:5482-5486) and (Musso et al (1995) J Exp Med. 181:1425- 
1431). The Jak kinases have also been shown to be activated by numerous ligands that signal 
via cytokine receptors such as, growth hormone and erythropoietin and IL-6 (Kishimoto 
(1994) Stem cells Suppl 12:37-44). 

Detection signals which may be scored for in the present assay, in addition to direct 
detection of second messangers, such as by changes in phosphorylation, includes reporter 
constructs which include transcriptional regulatory elements responsive to the STAT 
proteins. Described infra. 

VI Multisubunit Immune Recognition Receptor (MIRR). 

In another embodiment the receptor is a multisubunit receptor. Receptors can be 
comprised of multiple proteins referred to as subunits, one category of which is referred to as 
a multisubunit receptor is a multisubunit immune recognition receptor (MIRR). MIRRs 
include receptors having multiple noncovalently associated subunits and are capable of 
interacting with src-family tyrosine kinases. MIRRs can include, but are not limited to, B 
cell antigen receptors, T cell antigen receptors, Fc receptors and CD22. One example of an 
MIRR is an antigen receptor on the surface of a B cell. To further illustrate, the MIRR on the 
surface of a B cell comprises membrane-bound immunoglobulin (mlg) associated with the 
subunits Ig-a and Ig-p or Ig-y, which forms a complex capable of regulating B cell function 
when bound by antigen. An antigen receptor can be functionally linked to an amplifier 
molecule in a manner such that the amplifier molecule is capable of regulating gene 
transcription. 

Src-family tyrosine kinases are enzymes capable of phosphorylating tyrosine residues 
of a target molecule. Typically, a src-family tyrosine kinase contains one or more binding 
domains and a kinase domain. A binding domain of a src-family -tyrosine kinase is capable of 
binding to a target molecule and a kinase domain is capable of phosphorylating a target 
molecule bound to the kinase. Members of! the src family of tyrosine kinases are 
characterized by an N-terminal unique region followed by three regions that contain different 
degrees of homology among all the members of the family. These three regions are referred 
to as src homology region 1 (SHI), src homology region 2 (SH2) and src homology region 3 
(SH3). Both the SH2 and SH3 domains are believed to have protein association functions 
important for the formation of signal transduction complexes. The amino acid sequence of an 
N-terminal unique region, varies between each src-family tyrosine kinase. An N-terminal 
unique region can be at least about the first 40 amino acid residues of the N-terminal of a src- 
family tyrosine kinase. 



Syk-family kinases are enzymes capable of phosphorylating tyrosine residues of a 
target molecule. Typically, a syk-family kinase contains one or more binding domains and a , 
kinase domain. A binding domain of a syk-family tyrosine kinase is capable of binding to a 
target molecule and a kinase domain is capable of phosphorylating a target molecule bound to 
the kinase. Members of the syk- family of tyrosine kinases are characterized by two SH2 
domains for protein association function and a tyrosine kinase domain. 

A primary target molecule is capable of further extending a signal transduction 
pathway by modifying a second messenger molecule. Primary target molecules can include, 
but are not limited to, phosphatidylinositol 3-kinase (PI-3K), P21 ras GAPase-activating 
protein and associated PI 90 and P62 protein, phospholipases such as PLCyl and PLCy2, 
MAP kinase, She and VAV. A primary target molecule is capable of producing second 
messenger molecule which is capable of further amplifying a transduced signal. Second 
messenger molecules . include, but are not limited to diacylglycerol and inositol 1,4,5- 
triphosphate (IP3). Second messenger molecules are capable of initiating physiological 
events which can lead to alterations in gene transcription. For example, production of IP3 
can result in release of intracellular calcium, which can then lead to activation of calmodulin 
kinase II, which can then lead to serine phosphorylation of a DNA binding protein referred to 
as ets-1 proto-onco-protein. Diacylglycerol is capable of activating the signal transduction 
protein, protein kinase C which affects the activity of the API DNA binding protein complex. 
Signal transduction pathways can lead to transcriptional activation of genes such as c-fos, 
egr-1, and c-myc. 

She can be thought of as an adaptor molecule. An adaptor molecule comprises a 
protein that enables two other proteins to form a complex (e.g., a three molecule complex). 
She protein enables a complex to form which includes Grb2 and SOS. She comprises an SH2 
domain that is capable of associating with the SH2 domain of Grb2. 

Molecules of a signal transduction pathway can associate with one another using 
recognition sequences. Recognition sequences enable specific binding between two 
molecules. Recognition sequences can vary depending upon the structure of the molecules 
that are associating with one another. A molecule can have one or more recognition 
sequences, and as such ean-associate with one or more different molecules. 

Signal transduction pathways for MIRR complexes are capable of regulating the 
biological functions of a cell. Such functions can include, but are not limited to the ability of 
a cell to grow, to differentiate and to secrete cellular products. MIRR-induced signal 
transduction pathways can regulate the biological functions of specific types of cells involved 
in particular responses by an animal, such as immune responses, inflammatory responses and 
allergic responses. Cells involved in an immune response can include, for example, B cells, 
T cells, macrophages, dendritic cells, natural killer cells and plasma cells. Cells involved in 



inflammatory responses can include, for example, basophils, mast cells, eosinophils, 
neutrophils and macrophages. Cells involved in allergic responses can include, for example 
mast cells, basophils, B cells, T cells and macrophages. 

In exemplary embodiments of the subject assay, the detection signal is a second 
messangers, such as a phosphorylated src-like protein, includes reporter constructs which 
include transcriptional regulatory elements such as serum response element (SRE), 12-0- 
tetradecanoyl-phorbol-13-acetate response element, cyclic AMP response element, c- fos 
promoter, or a CREB-responsive element. 

VII Nuclear Receptors. 

In another embodiment, the target receptor is a nuclear receptor. The nuclear 
receptors may be viewed as ligand-dependent transcription factors. These receptors provide a 
direct link between extracellular signals, mainly hormones, and transcriptional responses. 
Their transcriptional activation fuction is regulated by endogenous small molecules, such as 
steroid hormones, vitamin D, ecdysone, retinoic acids and thyroid hormones, which pass 
readily through the plasma membrane and bind their receptors inside the cell (Laudet and 
Adelmant (1995) Current Biology 5:124). The majority of these receptors appear to contain 
three domains: a variable amino terminal domain; a highly conserved, DNA-binding domain 
and a moderately conserved, carboxyl-terminal ligand-binding domain (Power et al. (1993) 
Curr. Opin. Cell Biol. 5:499-504). Examples include the estrogen, progesterone, androgen, 
thyroid hormone and mineralocorticoid receptors. In addition to the known steroid 
receptors, at least 40 orphan members of this superfamily have been identified. (Laudet et 
al., (1992) EMBO 1 11:1003-1013). There are at least four groups of orphan nuclear 
receptors represented by NGF1, FTZ-F1, Rev-erbs, and RARs, which are by evolutionary 
standards, only distantly related to each other (Laudet et al. supra). While the steroid 
hormone receptors bind exclusively as homodimers to a palindrome of their hormone 
responsive element other nuclear receptors bind as heterodimers. Interestingly, some orphan 
receptors bind as monomers to similar response elements and require for their function a 
specific motif that is rich in basic amino-acid residues and is located corboxy-terminal to the 
"DNA-binding domain (Laudet and Adelmant supra.) 

In preferred embodiments, the subject assay is derived to utilize a hormone-dependent 
reporter construct for selection. For instance, glucocorticoid response elements (GREs) and 
thyroid receptor enhancer-like DNA sequences (TREs) can be used to drive expression of 
reporter construct in response to hormone binding to hormone receptors. GRE's are enhancer- 
like DNA sequences that confer glucocorticoid responsiveness via interaction with the 
glucocorticoid receptor. See Payvar, et al. (1983) Cell 35:381 and Schiedereit et al. (1983) 
Nature 304:749. TRE's are similar to GRE's except that they confer thyroid hormone 
responsiveness via interaction with thyroid hormone receptor. Turning now to the interaction 
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of hormones and receptors, it is known that a steroid or thyroid hormone enters cells by 
facilitated diffusion and binds to its specific receptor protein, initiating an allosteric alteration 
of the protein. As a result of this alteration, the hormone/receptor complex is capable of 
binding to certain specific sites on transcriptional regulatory sequence with high affinity. 

It is also known that many of the primary effects of steroid and thyroid hormones 
involve increased transcription of a subset of genes in specific cell types. Moreover, there is 
evidence that activation of transcription (and, consequently, increased expression) of genes 
which are responsive to steroid and thyroid hormones (through interaction of chromatin with 
hormone receptor/hormone complex) is effected through binding of the complex to enhancers 
associated with the genes. 

In any case, a number of steroid hormone and thyroid hormone responsive 
transcriptional control units, some of which have been shown to include enhancers, have been 
identified. These include the mouse mammary tumor virus 5-long terminal repeat (MMTV 
LTR), responsive to glucocorticoid, aldosterone and androgen hormones; the transcriptional 
control units for mammalian growth hormone genes, responsive to glucocorticoids, estrogens, 
and thyroid hormones;, the transcriptional control units for mammalian prolactin genes and 
progesterone receptor genes, responsive to estrogens; the transcriptional control' units for 
avian ovalbumin genes, responsive to progesterones; mammalian metallothionein gene 
transcriptional control units, responsive to glucocorticoids; and mammalian hepatic alpha 2u 
-globulin gene transcriptional control units, responsive to androgens, estrogens, thyroid 
hormones and glucocorticoids. Such steroid hormone and thyroid hormone responsive 
transcriptional control units can be used to generate reporter constructs which are sensitive to 
agonists and antagonists of the steroid hormone and/or thyroid hormone receptors. See, for 
example, U.S. Patents 5,298,429 and 5,071,773, both to Evans, et. al. Moreover, the art 
describes the functional expression of such receptors in yeast. See also for example, Caplan 
et al: (1995) J Biol Chem 270:5251-7; and Baniahmad et al. (1995) Mol Endocrinol 9: 34-43. 

VIII. Receptor tyrosine kinases . 

In still another embodiment, the target receptor is a receptor tyrosine kinase. The 
receptor tyrosine kinases can be divided into five subgroups on the basis of structural 
similarities in their extracellular domains and the organization of the tyrosine kinase catalytic 
region in their cytoplasmic domains. Sub-groups I (epidermal growth factor (EGF) receptor- 
like), II (insulin receptor-like) and the eph/eck family contain cysteine-rich sequences (Hirai 
et al, (1987) Science 238:1717-1720 and Lindberg and. Hunter, (1990) Mol Cell Biol 
10:6316-6324). The functional domains of the kinase region of these three classes of 
receptor tyrosine kinases are encoded as a contiguous sequence ( Hanks ef al. (1988) Science 
241:42-52). Subgroups III (platelet-derived growth factor (TDGF) receptor-like) and IV fthe 



fibroblast growth factor (FGF) receptors) are characterized as having immunoglobulin (Ig)- 
like folds in their extracellular domains, as well as having their kinase domains divided in 
two parts by a variable stretch of unrelated* amino acids (Yanden and Ullrich (1988) supra 
and Hanks et al. (1988) supra). 

The family with by far the largest number of known members is the EPH family. 
Since the description of the prototype, the EPH receptor (Hirai et al. .(1987) Science 
238:1717-1720), sequences have been reported for. at least ten members of this family, not 
counting apparently orthologous receptors found in more than one species. Additional partial 
sequences, and the rate at which new members are still being reported, suggest the family is 
even larger (Maisonpierre et al. (1993) Oncogene 8:3277-3288; Andres et al. (1994) 
Oncogene 9:1461-1467; Henkemeyer et al. (1994) Oncogene 9:1001-1014; Ruiz et al. (1994) 
Mech Dev 46:87-100; Xu et al. (1994) Development 120:287-299; Zhou et al. (1994) J 
Neurosci Res 37:129-143; and references in Tuzi and Gullick (1994) Br J Cancer 69:417- 
421). Remarkably, despite the large number of members in the EPH family, all of these 
molecules were identified as orphan receptors without known ligands. 

The expression patterns determined for some of the EPH family receptors have 
implied important roles for these molecules in early vertebrate development. In particular, 
the timing and pattern of expression of sek, mek4 and some of the other receptors during the 
phase of gastrulation and early organogenesis has suggested functions for these receptors in 
the important cellular interactions involved in patterning the embryo at this stage (Gilardi- 
Hebenstreit et al. (1992) Oncogene 7:2499-2506; Nieto et al, (1992) Development 116:1137- 
1 150; Henkemeyer et al., supra; Ruiz et al., supra; and Xu et al., supra). Sek, for example, 
shows a notable early expression in the two areas of the mouse embryo that show obvious 
segmentation, namely the somites in the mesoderm and the rhombomeres of the hindbrain; 
hence the name sek, for segmentally expressed kinase (Gilardi-Hebenstreit et al., supra; Nieto 
et al., supra). As in Drosophila, these segmental structures of the mammalian embryo are 
implicated as important elements in establishing the body plan. The observation that Sek 
expression precedes the appearance of morphological segmentation suggests a role for sek in 
forming these segmental structures, or in determining segment-specific cell properties such as 
lineage compartmentation (Nieto et al., supra). Moreover, EPH receptors have been 
implicated, by their pattern of expression; in the development and maintenance of nearly 
every tissue in the embryonic and adult body. For instance, EPH receptors have been 
detected throughout the nervous system, the testes, the cartilaginous model of the skeleton, 
tooth primordia, the infundibular component of the pituitary, various epithelia tissues, lung, 
pancreas, liver and kidney tissues. Observations such as this have been indicative of 
important and unique roles for EPH family kinases in development and physiology, but 
further progress in understanding their action has been severely limited by the lack of 
information on their ligands. 
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As used herein, the terms "EPH receptor" or "EPH-type receptor" refer to a class of 
receptor tyrosine kinases, comprising at least eleven paralogous genes, though many more 
orthologs exist within this class, e.g. homologs from different species. EPH receptors, in 
general, are a discrete group of receptors related by homology and easily reconizable, e.g., 
they are typically characterized by an extracellular domain containing a characteristic spacing 
of cysteine residues near the N-terminus and two fibronectin type III repeats (Hirai et al. 
{mi) Science 238:1717-1720; Lindberg et al. (1990) Mol Cell Biol 10:6316-6324; Chan et 
al. (1991) Oncogene 6:1057-1061; Maisonpierre et al. (1993) Oncogene 8:3277-3288; 
Andres et al. (1994) Oncogene 9:1461-1467; Henkemeyer et al. 094) Oncogene 9:1001- 
1014; Ruiz et al. (1994) Mech Dev 46:87-100; Xu et al. (1994) Development 120:287-299; 
Zhou et al. (1994) J Neurosci Res 37:129-143; and references in Tuzi and Gullick (1994) Br 
J Cancer 69:417-421). Exemplary EPH receptors include the eph, elk, eck, sek, mek4, hek, 
hek2, eek, erk, tyrol, tyro4, tyro5, tyro6, tyroll, cek4, cek5, cek6, cek7, cek8, cek9, ceklO, 
bsk, rtkl,rtk2, rtk3, mykl, myk2, ehkl, ehk2,pagliaccio, htk, erk and nuk receptors. The term 
"EPH receptor" refers to the membrane form of the receptor protein, as well as soluble 
extracellular fragments which retain the ability to bind the ligand of the present invention. 

In exemplary embodiments, the detection signal is provided by^ detecting 
phosphorylation of intracellular proteins, e.g., MEKKs, MEKs, or Map kinases, or by the use 
of reporter constructs which include transcriptional regulatory elements responsive to c-fos 
and/or c-jun. Described infra. 

IX. G Protein-Couvled Receptors. 

One family of signal transduction cascades found in eukaryotic cells utilizes 
heterotrimeric "G proteins." Many different G proteins are known to interact with receptors. 
G protein signaling systems include three components: the receptor itself, a GTP-binding 
, protein (G protein), and an intracellular target protein. 

The cell membrane acts as a switchboard. Messages arriving through different 
receptors can produce a single effect if the receptors act on the same type of G protein. On 
the other hand, signals activating a single receptor can produce more than one effect if the 
receptor acts on different kinds of G proteins, or if the G proteins can act on different 
effectors. 

In their resting state, the G proteins, which consist of alpha (a), beta (P) and gamma 
(y) subunits, are complexed with the nucleotide guanosine diphosphate (GDP) and are in 
contact with receptors. When a hormone or other first messenger binds to receptor, the 
receptor changes conformation and this alters its interaction with the G protein. This spurs 
the a subunit to release GDP, and the more abundant nucleotide guanosine triphosphate 
(GTP), replaces it, activating the G protein. The G protein then dissociates to separate the a 
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subunit from the still complexed beta and gamma subunits. Either the Ga subunit, or the GPy 
complex, depending on the pathway, interacts with an effector. The effector (which is often 
an enzyme) in turn converts an inactive precursor molecule into an active "second 
messenger," which may diffuse through the cytoplasm, triggering a metabolic cascade. After 
a few seconds, the Ga converts the GTP to GDP, thereby inactivating itself. The inactivated 
Ga may then reassociate with the GPy complex. 

Hundreds, if not thousands, of receptors convey messages through heterotrimeric G 
proteins, of which at least 17 distinct forms have been isolated. Although the greatest 
variability has been seen in the a subunit, several different p and y structures have been 
reported. There are, additionally, several different G protein-dependent effectors. 

Most G protein-coupled receptors are comprised of a single protein chain that is 
threaded through the plasma membrane seven times. Such receptors are often referred to as 
seven-transmembrane receptors (STRs). More than a hundred different STRs have been 
found, including many distinct receptors that bind the same ligand, and there are likely many 
more STRs awaiting discovery. 

In addition, STRs have been identified for which the natural ligands are unknown; 
these receptors are termed "orphan" G protein-coupled receptors, as described above. 
Examples include receptors cloned by Neote et al. (1993) Cell 72, 415; Kouba et al. FEBS 
Lett. (1993) 321, 173; Birkenbach etal.(1993) J. Virol. 67,2209. 

The "exogenous receptors" of the present invention may be any G protein-coupled 
receptor which is exogenous to the cell which is to be genetically engineered for the purpose 
of the present invention. This receptor may be a plant or animal cell receptor. Screening for 
binding to plant cell receptors may be useful in the development of, e:g., herbicides. In the 
case of an animal receptor, it may be of invertebrate or vertebrate origin. If an invertebrate 
receptor, an insect receptor is preferred, and would facilitate development of insecticides. The 
receptor may also be a vertebrate, more preferably^ a mammalian, still more preferably a 
human, receptor. The exogenous receptor is also preferably a seven transmembrane segment 
receptor. 

Known ligands for G protein coupled receptors include: purines and nucleotides, such 
as adenosine, cAMP, ATP, UTP, ADP, melatonin' and the like; biogenic amines (and related 
natural ligands), such as 5-hydroxytryptarnine, acetylcholine, dopamine, adrenaline, 
adrenaline, adrenaline., histamine, noradrenaline, noradrenaline, noradrenaline., 
tyramine/octopamine and other related compounds; peptides such as adrenocorticotrophic 
hormone (acth), melanocyte stimulating hormone (msh), melanocortins, neurotensin (nt), 
bombesin and related peptides, endothelins, cholecystokinin, gastrin, neurokinin b (nk3), 
invertebrate tachykinin-like peptides, substance k (nk2), substance p (nkl), neuropeptide y 
(npy), thyrotropin releasing-factor (trf), bradykinin, angiotensin ii, beta-endorphin, c5a 
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anaphalatoxin, calcitonin, chemokines (also called intercrines), corticotrophic releasing factor 
(erf), dynorphin, endorphin, finlp and other formylated peptides, follitropin (fsh), fungal 
mating pheremones, galanin, gastric inhibitory polypeptide receptor (gip), glucagon-like 
peptides (glps), glucagon, gonadotropin releasing hormone (gnrh), growth hormone releasing 
hormone(ghrh), iasect diuretic hormone, interleukin-8, leutropin (lh/hcg), met-enkephalin, 
opioid peptides, oxytocin, parathyroid hormone (pth) and pthrp, pituitary adenylyl cyclase 
activiating peptide (pacap), secretin, somatostatin, thrombin, thyrotropin (tsh), vasoactive 
intestinal peptide (vip), vasopressin, vasotocin; eicosanoids such as ip-prostacyclin, pg- 
prostaglandins, tx-thromboxanes; retinal based compounds such as vertebrate 11-cis retinal, 
invertebrate 11-cis retinal and other related compounds; lipids and lipid-based compounds 
such as cannabinoids, anandamide, lysophosphatidic acid, platelet activating factor, 
leukotrienes and the like; excitatory amino acids and ions such as calcium ions and 
glutamate. 

Suitable examples of G-protein coupled receptors include, but are not limited to, 
dopaminergic, muscarinic cholinergic, a-adrenergic, b-adrenergic, opioid (including delta and 
mu), cannabinoid, serotoninergic, and GABAergic receptors. Preferred receptors include the 
5HT family of receptors, dopamine receptors,C5a receptor and FPRL-1 receptor, cyclo- 
histidyl-proline-diketoplperazine receptors, melanocyte stimulating hormone release 
inhibiting factor receptor, and receptors for neurotensin, thyrotropin releasing hormone, 
calcitonin, cholecytokinin-A, neurokinin-2, histamine-3, cannabinoid, melanocortin, or 
adrenomodulin, neuropeptide- Yl or galanin. Other suitable receptors are listed in the art. The 
term "receptor," as used herein, encompasses both naturally occurring and mutant receptors. 

Many of these G protein-coupled receptors, like the yeast a- and a-factor receptors, 
contain seven hydrophobic amino acid-rich regions which are assumed to lie within the 
plasma membrane. Specific human G protein-coupled STRs for which genes have been 
isolated and for which expression vectors could be constructed include those listed herein arid 
others known in the art. Thus, the gene would be operably linked to a promoter functional in 
the cell to be engineered and to a signal sequence that also functions in the cell. For example 
in the case of yeast, suitable promoters include Ste2, Ste3 and gal 10 . Suitable signal 
sequences include those of Ste2, Ste3 and of other genes which encode proteins secreted by 
yeast cells. Preferably, when a yeast cell is used, the codons of the gene would be optimized 
for expression in yeastr-See Hoekeina-et al.,(1987) Mol Cell. Biol, 7:2914-24; Sharp, et al., 
(1986)14:5125-43. 

The homology of STRs is discussed in Dohlman et al., Ann. Rev. Biochem., (1991) . 
60:653-88. When STRs are compared, a distinct spatial pattern of homology is discernible. 
The transmembrane domains are often the most similar, whereas the N- and C-terminal 
regions, and the cytoplasmic loop connecting transmembrane segments V and VI are more 



divergent. 

The functional significance of different STR regions has been studied by introducing 
point mutations (both substitutions and deletions) and by constructing chimeras of different 
but related STRs. Synthetic peptides corresponding to individual segments have also been 
tested for activity. Affinity labeling has been used to identify ligand binding sites. 

It is conceivable that a foreign receptor which is expressed in yeast will functionally 
integrate into the yeast membrane, and there interact with the endogenous yeast G protein. 
More likely, either the receptor will need to be modified (e.g., by replacing its V-VI loop 
with that of the yeast STE2 or STE3 receptor), or a compatible G protein should be provided. 

If the wild-type exogenous G protein-coupled receptor cannot be made functional in 
yeast, it may be mutated for this purpose. A comparison would be made of the amino acid 
sequences of the exogenous receptor and. of the yeast receptors, and regions of high and low 
homology identified. Trial mutations would then be made to distinguish regions involved in 
ligand or G protein binding, from those necessary for functional integration in the membrane. 
The exogenous receptor would then be mutated in the latter region to more closely resemble 
the yeast receptor, until functional integration was achieved. If this were insufficient to 
achieve functionality, mutations would next be made in the regions involved in G protein 
binding. Mutations would be made in regions involved in ligand binding only as a last resort, 
and then an effort would be made to preserve ligand binding by making conservative 
substitutions whenever possible. 

Preferably, the yeast genome is modified so that it is unable to produce the yeast 
receptors which are homologous to the exogenous receptors in functional form. Otherwise, a 
positive assay score might reflect the ability of a peptide to activate the endogenous G 
. protein-coupled receptor, and not the receptor of interest. 

A. Chemoattractant receptors 

The N-formyl peptide receptor is a classic example of a calcium mobilizing G 
protein-coupled receptor expressed by neutrophils and other phagocytic cells of the 
^mammalian immune system (Snyderman et al. (1988) In Inflammation: Basic Principles and 
Clinical Correlates, pp. 309-323). N-formyl peptides of bacterial origin bind to the receptor 
and engage a complex activation program that results in directed cell movement, release of 
inflammatory granule contents, and activation of a latent NADPH oxidase which is important 
for the production of metabolites of molecular oxygen. This pathway initiated by receptor- 
ligand interaction is critical in host protection from pyogenic infections. 1 Similar signal 
transduction occurs in response to the inflammatpry peptides C5a and IL-8. 

Two other formyl peptide receptor like (FPRL) genes have been cloned based on their 



ability to hybridize to a fragment of the NFPR cDNA coding sequence. These have been 
named FPRL1 (Murphy et al. (1992) J. Biol Chen 267:7637-7643) and FPRL2 (Ye et al. 
(1992) Biochem Biophys Res. Comm. 184:582-589). FPRL2 was found to mediate calcium 
mobilization in mouse fibroblasts transfected with the gene and exposed to formyl peptide. 
In contrast, although FPRL1 was found to be 69% identical in amino acid sequence to NFPR, 
it did not bind prototype N-formyl peptides ligands when expressed in heterologous cell 
types. This lead to the hypothesis of the existence of an as yet unidentified ligand for the 
FPRL1 orphan receptor (Murphy et al. supra). ' 

Using the. technology described herein a ligand has been cloned for these orphan 
receptors. 

B. G proteins ' , 

In the case of an exogenous G-protein coupled receptor, the yeast cell must be able to 
produce a G protein which is activated by the exogenous receptor, and which can in turn 
activate the yeast effector(s). The art suggests that the endogenous yeast Get subunit (e.g., 
GPA) will be often be sufficiently homologous to the "cognate" Get subunit which is natively 
associated with the exogenous receptor for coupling to occur. More likely, it will be 
necessary to genetically engineer the yeast cell to produce a foreign Get subunit which can 
properly interact with the exogenous receptor. For example, the Get subunit of the yeast G 
protein may be replaced by the Got subunit natively associated with the exogenous receptor. 

Dietzel and Kurjan, (1987) Cell, 50:1001) demonstrated that rat Gas functionally 
coupled to the yeast Gp> complex. However, rat Gcti2 complemented only when 
substantially overexpressed, while GotO did not complement at all. Kang, et al., Mol. Cell. 
Biol, (1990)10:2582). Consequently, with some foreign Got subunits, it is not feasible to 
simply replace the yeast Got. 

If the exogenous G protein coupled receptor is not adequately coupled to yeast GPy 
by the Got subunit natively associated with the receptor, the Got subunit may be modified to 
improve coupling. These modifications often will take the form of mutations which increase 
the resemblance of the Got subunit to the yeast. Got while decreasing its resemblance to the 
receptor-associated Got. For example, a residue may be changed so as to become identical to 
the corresponding yeast Got residue, or to at least belong to the same exchange grbup of that 
residue. After modification, the modified Got subunit might or might not be "substantially 
homologous" to the foreign and/or the yeast Get subunit. 

The modifications are preferably concentrated in regions of the Got which are likely 
to be involved in Gpy binding. In some embodiments, the modifications will take the form of 
replacing one or more segments of the receptor-associated Got with the corresponding yeast 
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Gct segment(s), thereby forming a chimeric Get subunit. (For the purpose of the appended 
claims, the term "segment" refers to three or more consecutive amino acids.) In other 
embodiments, point mutations may be sufficient. 

This chimeric Get subunit will interact with the exogenous receptor and the yeast GPy 
complex, thereby permitting signal transduction. While use . of the endogenous yeast GPy is 
preferred, if a foreign or chimeric Gpy is capable of transducing the signal to the yeast 
effector, it may be used instead. 

C. Got Structure 

Some aspects of Got structure are relevant to the design of modified Ga subunits. 
The amino terminal 66 residues of GPA1 are aligned with the cognate domains of human 
Gas, Gcti2, Goti3, Gal 6 and transducin. In the GPA41Ga hybrids, the amino terminal 41 
residues (derived from GPA1) are identical, end with the sequence-LEKQRDKNE- and are 
underlined for emphasis. All residues following the glutamate (E) residue at positional are 
contributed by the human Ga subunits, including the consensus nucleotide binding motif 
-GxGxxG-. Periods in the sequences indicate gaps that have been introduced to maximize 
alignments in this region. Codon bias is mammalian. For alignments of the entire coding 
regions of GPA1 with Gas, Gai, and GaO, Gaq and Gaz, see Dietzel and Kurjan (1987, 
Cell 50:573) and Lambright, et al. (1994, Nature 369:621-628). Additional sequence 
information is provided by Mattera, et al. (1986, FEES Lett 206:36-41), Bray, et al. (1986, 
Proc. Natl Acad. Sci USA 83:8893-8897) and Bray, et al. (1987, Proc Natl Acad Sci USA 
84:5115-5119). 

The gene encoding a G protein homolog of S. cerevisiae was cloned independently by 
Dietzel and Kurjan {supra) (SCG1) and by Nakafiiku, et al. (1987 Proc Natl Acad Sci 
84:2140-2144) (GPA1). Sequence analysis revealed a high degree of homology between the 
protein encoded by this gene and mammalian Ga. GPA1 encodes a protein of 472 amino 
acids, as compared with approximately 340-350 a. a. for most mammalian Ga subunits in 
four described families, Gas, Gai, Gaq and Gal2/1 3. Nevertheless, GPA1 shares overall 
sequence and structural homology with all Ga proteins identified to date. The highest overall 
homology in GPA1 is to the Gai family (48% identity, or 65% with conservative 
substitutions) and the lowest is to GQS (33% identity, or 51% with conservativesubstitutions) 
(Nakafuku, et al., supra). 

The regions of high sequence homology among Ga subunits are dispersed throughout 
their primary sequences, with the regions sharing the highest degree of homology mapping to 
sequence that comprises the guanine nucleotide binding/GTPase domain. This domain is 
structurally similar to the ap fold of ras proteins and the protein synthesis elongation factor 
EF-Tu. This highly conserved guanine nucleotide-binding domain consists of a six-stranded 



P sheet surrounded by a set of five a-helices. It is within these p sheets and a helices that the 
highest degree of conservation is observed among all Got proteins, including GPA1. The least 
sequence and structural homology is found in the intervening loops between the P sheets and 
a helices that define the core GTPase domain. There are a total of four "intervening loops" or 
"inserts" present in all Got subunits. In the crystal structures reported to date for the GDP- 
and GTPyS-liganded forms of bovine rod transducin (Noel, et al. (1993) Nature 366:654- 
663); (Lambright, et al. (1994) Nature 369:621-628), the loop residues are found to be 
outside the core GTPase structure. Functional roles for these loop structures have been 
established in only a few instances. A direct role in coupling to phosphodiesterase-? has been 
demonstrated for residues within inserts 3 and 4 of Gat (Rarick, et al. (1992) Science 
256:1031-1033); (Artemyev, et al. (1992) J. Biol Chem. 267:25067-25072), while a 
"GAP-like" activity has been ascribed to the largely a-helieal insert 1 domain of GaS 
(Markby, et al. (1993) Science 262:1805-1901). 

While the amino- and carboxy-termini of Ga subunits do not share striking homology 
either at the primary, secondary, or tertiary levels, there are several generalizations that can 
be made about them. First, the amino termini of Ga subunits have been implicated in the 
association of Ga with GPy complexes and in membrane association via N-terminal 
myristoylation. In addition, the carboxy-termini have been implicated in the association of 
Gapy heterotrimeric complexes with G protein-coupled receptors (Sullivan, et al. (1987) 
Nature 330:758-760); West, et al. (1985) J. Biol Chem, 260:14428-14430); (Conklin, et al. 
(\993)Nature 363:274-276). Data in support of these generalizations about the function of 
the N-terminus derive from several sources, including both biochemical and genetic studies. 

As indicated above, there is little if any sequence homology shared among the amino 
termini of Ga subunits. The amino terminal domains of Ga subunits that precede the first 
P-sheet (containing the sequence motif -LLLLGAGESG-; see Noel, et al. (supra) for the 
numbering of the structural elements of Ga subunits) vary in length from 41 amino acids 
(GPA1) to 31 amino acids (Gat). Most Ga subunits share the consensus sequence for the 
addition of myristic acid at their amino termini (MGXaaS-), although not all Ga subunits that 
contain this motif have myristic acid covalently associated with the glycine at position 2 
(Speigel, et al. (1991) TIBS 16:338-3441). The role of this post-translational modification has 
been inferred from studies in which the activity of mutant Ga subunits from which the 

consensus sequence for myristoylation has been added or deleted has been assayed (Mumby 
et al. (1990) Proc. Natl Acad. Sci. USA 87: 728-732; (Under, et al. (1991) J. Biol Chem, 
266:4654-4659); Gallego, et al. (1992) Proc. Natl Acad. Sci. USA 89:9695-9699). These 
studies suggest two roles for N-terminal myristoylation. First, the presence of amino-terminal 
myristic acid has in some cases been shown to be required for association of Ga subunits 
with the membrane, and second, this modification has been demonstrated to play a role in 
modulating the association of Ga subunits with Gpy complexes. The role of myristoylation 



of the GPA1 gene products, at present, unknown. 

In other biochemical studies aimed at examining the role of the ammo-terminus of Ga 
in driving the association between Ga and GPy subunits, proteolytically or genetically 
truncated versions of Ga subunits were assayed for their ability to associate with 
GPycomplexes, bind guanine nucleotides and/or to activate effector molecules. In all cases, 
Ga subunits with truncated amino termini were deficient in all three functions (Graf, et al. 
(1992) J. Biol Chem. 267:24307-24314); (Journot, et al. (1990) J. Biol. Chem. 265:9009- 
9015); and (Neer, et al. (1988) J. Biol Chem 263:8996-9000). Slepak, et al. (1993, J. Biol 
Chem. 268:1414-1423) reported a mutational analysis of the N-terminal 56 a.a. of 
mammalian Gao expressed in Escherichia coli. Molecules with an apparent reduced ability 
to interact with exogenously added mammalian GPy were identified in, the mutant library. As 
the authors pointed out, however, the assay used to screen the mutants the extent of 
ADP-ribosylation of the mutant Ga by pertussis toxin was not a completely satisfactory 
probe of interactions between Ga and GPy. Mutations identified as inhibiting the interaction 
of the subunits, using this assay, may still permit the complexing of Ga and Gpy while 
sterically hindering the ribosylation of Ga by toxin. Genetic studies examined the role of 
amino-terminal determinants of Ga in heterotrimer subunit association, have been carried out 
in both yeast systems using GPA1 -mammalian Ga hybrids (Kang, et al. (1990) Mol Cell 
Biol 10:2582-2590) and in mammalian systems using Gai/Gas hybrids (Russell and 
Johnson (1993) Mol Pharmacol 44:255-263). In the former studies, gene fusions^ composed 
of yeast GPA1 and mammalian Ga sequences were constructed by Kang, et al. (supra) and 
assayed for their ability to complement a gpal null phenotype (i.e., constitutive activation of 
the pheromone response pathway) in S. cerevisiae. Kang, et al. demonstrated that wild type 
mammalian Gas, Gai but not Gao proteins are competent to associate with yeast Ga and 
suppress the gpal null phenotype, but only when overexpressed. Fusion proteins containing 
the amino-terminal 330 residues of GPA1 sequence linked to 160, 143, or 142 residues of the 
mammalian Gas, Gai and Gao carboxyl-terminal regions, respectively, also coupled to the 
yeast mating response pathway when overexpressed on high copy plasmids with strong 
inducible (CUP) or constitutive (PGK) promoters. All three of these hybrid molecules were 
able to complement the gpal null mutation in a growth arrest assay, and were additionally 
able to inhibit afactor responsiveness and mating in tester strains. These last two 
observations argue that hybrid yeast-mammalian Ga subunits are capable of interacting 
directly with yeast Gpy, thereby, disrupting the normal function of the yeast heterotrimer. 
Fusions containing the amino terminal domain of Gas, Gai pr Gao, however, did not 
complement the gpal null phenotype, indicating a requirement for determinants in the amino 
terminal 330 amino acid residues of GPA1 for association and sequestration of yeast Gpy 
complexes. Taken together, these data suggest that determinants in the amino terminal region 
of Ga subunits determine not only the ability to associate with GPy subunits in general, but 
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also with specific Gpy subunits in a species-restricted manner. 

Hybrid Gai/Gas subunits have been assayed in mammalian expression systems 
(Russell and Johnson {supra). In these studies, a large number of chimeric Ga subunits were 
assayed for an ability to activate adenylyl cyclase, and therefore, indirectly, for an ability to 
interact with Gpy(i.e., coupling of Ga to GPy= inactive cyclase; uncoupling of Ga from GPy 
= active cyclase). From these studies a complex picture emerged in which determinants in the 
region between residues 25 and 96 of the hybrids were found to determine the state of 
activation of these alleles as reflected in their rates of guanine nucleotide exchange and GTP 
hydrolysis and the extent to which they activated adenylyl cyclase in vivo. These data could 
be interpreted to support the hypothesis that structural elements in the region between the 
amino terminal methionine and the -1 sheet identified in the crystal structure of Gat (see 
Noel, et al. supra and Lambright, et al. supra) are involved in determining the state of 
activity of the heterotrimer by (1) driving association/dissociation between Gaand Gpy 
subunits; (2) driving GDP/GTP exchange. While there is no direct evidence provided by 
these studies to support the idea that residues in this region of Ga and residues in Gpy 
subunits contact one another, the data nonetheless provide a positive indication for the 
construction of hybrid Ga subunits that retain function. There is, however, a negative 
indicator that derives from this work in that some hybrid constructs resulted in constitutive 
activation of the chimeric proteins (i.e., a loss of receptor-dependent stimulation of GPy r 
. dissociation and effector activation). 

D. Construction of chimeric Go. subunits. 

In designing Ga subunits capable of transmitting, in yeast, signals originating at 
mammalian G protein-coupled, receptors, two general desiderata were recognized. First, the 
subunits should retain as much of the sequence of the native mammalian proteins as possible. 
Second, the level of expression for the heterologous components should approach, as closely 
as possible, the level of their endogenous counterparts. The results described by King, et al. 
(1990, Science 250: 12U123) for expression of the human p2-adrenergic receptor and Gas in 
yeast, taken together with negative results obtained by Kang, et al. (supra) with full-length 
mammalian Got subunits other than Gas, led us to the following preferences for the 

development of yeast strains in which mammalian G protein-coupled receptors could be 
linked to the pheromone response pathway. 

1. Mammalian Ga subunits will be expressed using the native sequence of each 
subunit or, alternatively, as minimal gene fusions with sequences from the amino- terminus 
of GPA1 replacing the homologous residues from the mammalian Ga subunits. 

2. Mammalian Ga subunits will be expressed from the GPA1 promotor either on low 
copy plasmids or after integration into the yeast genome as a single copy gene. 



3. Endogenous Gpy subunits will be provided by the yeast STE4 and STE18 loci. 

E. Site-Directed Mutagenesis versus Random Mutagenesis 

There are two general approaches to solving structure-function problems of the sort 
presented by attempts to define the determinants involved in mediating the association of the 
subunits that comprise the G protein heterotrimer. The first approach, discussed above with 
respect to hybrid constructs, is a rational one in which specific mutations or alterations are 
introduced into a molecule based upon the available experimental evidence. In a second 
approach, random mutagenesis techniques, coupled with selection or screening systems, are 
used to introduce large numbers of mutations into a molecule, and that collection of 
randomly mutated molecules is then subjected to a selection for the desired phenotype or a 
. screen in which the desired phenotype can be observed against a background of undesirable 
phenotypes. With random mutagenesis one can mutagenize an entire molecule or one can 
proceed by cassette mutagenesis. In the former instance, the entire coding region of a 
molecule is mutagenized by one of several methods (chemical, PCR, doped oligonucleotide 
synthesis) and that collection of randomly mutated molecules is subjected to selection or 
screening procedures. Random mutagenesis can be applied in this way in cases where the 
molecule being studied is relatively small and there are powerful and stringent selections or 
screens available to discriminate between the different classes of mutant phenotypes that will 
inevitably arise. In the second approach, discrete regions of a protein, corresponding either to 
defined structural (i.e. a-helices, p -sheets, turns, surface loops) or functional determinants 
(e.g., catalytic clefts," binding determinants, transmembrane segments) are subjected to 
saturating or semi-random mutagenesis and these mutagenized cassettes are re-introduced 
into the context of the otherwise wild type allele. Cassette mutagenesis is most useful when 
there is experimental evidence available to suggest a particular function for a region of a 
molecule and there is a powerful selection and/or screening approach available to 
discriminate between interesting and uninteresting mutants. Cassette mutagenesis is also 
useful when the parent molecule is comparatively large and the desire is to map the 
functional domains of a molecule by mutagenizing the molecule in a step-wise fashion, i.e. 
^mutating one linear cassette of residues at a time and then assaying for function. 

The present invention contemplates applying random mutagenesis in order to further 
delineate the determinants involved in Ga-Gf}y association. Random mutagenesis may be 
accomplished by many means, including: 

1 . PCR mutagenesis, in which the error prone Taq polymerase is exploited to generate 
mutant alleles of Get subunits, which are assayed directly in yeast for an ability to couple to 
yeast Gpy. 

2. Chemical mutagenesis, in which expression cassettes encoding Got subunits are 
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exposed to mutagens and the protein products of the mutant sequences are assayed directly in 
yeast for an ability to couple to yeast G py. 

3. Doped synthesis of oligonucleotides encoding portions of the Get gene. 

4. In vivo mutagenesis, in which random mutations are introduced into the coding 
region of Got subunits by passage through a mutator strain of E. coli, XL 1 -Red (mutD5 mutS 
mutT) (Stratagene, Menasa, WI). 

The random mutagenesis may be focused on regions suspected to be involved in 
Ga-GPy association as discussed in the next section. Random mutagenesis approaches are 
feasible for two reasons. First, in yeast one has the ability to construct stringent screens and 
facile selections (growth vs. death, transcription vs. lack of transcription) that are not readily 
available in mammalian systems. Second, when using yeast it is possible to screen efficiently 
through thousands of transformants rapidly. Cassette mutagenesis is immediately suggested 
by the observation (see infra) that the GPA41 hybrids couple to the pheromone response 
pathway. This relatively small region of Get subunits represents a reasonable target for this 
type of mutagenesis. Another region that may be amenable to cassette mutagenesis is that 
defining the surface of the switch region of Got subunits that is solvent-exposed in the crystal 
structures of Goti and transducin. From the data described below, this surface may contain 
residues that are in direct contact with yeast GPy subunits, and may therefore be a reasonable 
target for mutagenesis. 

F. Rational Design of Chimeric Ga Subunits 

Several classes of rationally designed GPA1 -mammalian Ga hybrid subunits have 
been tested for the ability to couple to yeast Py. The first, and largest, class of hybrids are 
those that encode different lengths of the GPA1 amino terminal domain in place of the 
homologous regions of the mammalian Ga subunits. This class of hybrid molecules includes 
GPAgAMHl > GPA41, GPAjd, and GPAlv/ hybrids, described below. The rationale for 
constructing these hybrid Ga proteins is based on results, described above, that bear on the 
importance of the amino terminal residues of Ga in mediating interaction with GPy. 

Preferably, the yeast Ga subunit is replaced by a chimeric Ga subunit in which a 
portion, e.g., at least about 20, more preferably at least about 40, amino acids, which is 
substantially homologous with the corresponding residues of the amino terminus of the yeast 
Ga, is fused to a sequence substantially homologous with the main body of a mammalian (or 
other exogenous) Ga. While 40 amino acids is the suggested starting point, shorter or longer 
portions may be tested to determine the minimum length required for coupling to yeast GPy 
and the maximum length compatible with retention of coupling to the exogenous receptor. It 
is presently believed that only the final 10 or 20 amino acids at the carboxy terminus of the 



Got subunit are required for interaction with the receptor. 

GPA gAMHl hybrids . Kang et al.. supra, described hybrid G a subunits encoding the 
amino terminal 310 residues of GPA1 fused to the carboxyl terminal 160, 143 and 142 
residues, respectively, of GaS, Gai2, and Gao. In all cases examined by Kang et al., the 
hybrid proteins were able to complement the growth arrest phenotype of gpal strains. We 
have confirmed these findings and, in addition, have constructed and tested hybrids between 
GPA1 and Gai3, Gaq and Gal 6. All hybrids of this type that have been tested functionally 
complement the growth arrest phenotype of gpal strains. 

GPA41 hybrids. The rationale for constructing a minimal hybrid encoding only 41 
amino acids of GPA1 relies upon the biochemical evidence for the role of the ammo-terminus 
of Got subunits discussed above, together with the following observation. G p and Gy 
subunits are known to interact via a- helical domains at their respective amino-termini 
(Pronin, et al. (1992) Proc. Natl Acad. Sci. USA 89:6220-6224); Garritsen, et all 993). The 
suggestion that the amino termini of Ga subunits may form an helical coil and that this 
helical coil may be involved in association of Ga with G(Jy (Masters et al (1986) Protein 
Engineering 1:47-54); Lupas et al.(1992) FEBS Lett. 314:105-108) leads to the hypothesis 
that the three subunits of the G-protein heterotrimer interact with one another reversibly 
through the winding and unwinding of their amino-terminal helical regions. A mechanism of 
this type has been suggested, as well, from an analysis of leucine zipper mutants of the GCN4 
transcription factor (Harbury, et al. (1993) Science 262:1401-1407). The rationale for 
constructing hybrids like those described by Kang, et al. supra., that contain a majority of 
yeast sequence and only minimal mammalian sequence, derives from their ability to function 
in assays of coupling between Ga and GPy subunits. However, these chimeras had never 
been assayed for an ability to couple to both mammalian G protein-coupled receptors and 
yeast GPy subunits, and hence to reconstitute a hybrid signaling pathway in yeast; - 

GPA41 hybrids that have been constructed and tested include Gas, Gai2, Gai3, Gaq, 
Gao a , Gaob and Gal 6. Hybrids of Gas, Gai2, Gai3, and Gal6 functionally complement 
the growth arrest phenotype of gpal strains, while GPA41 hybrids of Gao a and Gaob do 
not. In addition to being tested in a growth arrest assay, these constructs have been assayed in 
the more sensitive transcriptional assay for activation of a fuslp-HIS3 gene. In both of these 
assays, the GPA4i-Gas hybrid couples less well than the GPa4i-i2, -i3, and -16 hybrids, 
while the GPa4j -o a , and -Ob hyrids do not function in either assay. 

Several predictive algorithms indicate that the amino terminal domain up to the 
highly conserved sequence motif-LLLLGAG^fb^ (&e Tirst L in this motif is residue 43 in 
GPAlj forms a helical structure with'amphipathic character. Assuming that a heptahelical 
repeat unit, the following hybrids between GPA1 and GaS can be used to define the number 
of helical repeats in this motif necessary for hybrid function: 



GPAl-7/Gas8-394 

GPA1-14/Gasl 5-394 

GPAl-21/Gas22-394 

GPAl-28/Gas29-394 

GPAl-35/Gas36-394 

GPAl-42/Gas43-394 

In these hybrids, the prediction is that the structural repeat unit in the amino terminal 
domain up to the tetra-leucine motif is 7, and that swapping sequences in units of 7 will in 
effect amount to a swap of unit turns of turns of the helical structure that comprises this 
domain. 

A second group of "double crossover'" hybrids of this class are those that are aligned 
oh the first putative heptad repeat beginning with residue Gil in GPA1. In these hybrids, 
helical repeats are swapped from GPA1 into a GaS backbone one heptad repeat unit at a time. 

GaS 1 -1 0/GPA1 1 -1 7/Gasl 8-394 

GaS 1 - 1 7/GPA 1 8-24/GaS25-3 94 

GaSl-17/GPA25-31/GaS32-394 

GaS?-17/GPA32-38/GaS39-394 

The gap that is introduced ^ etweer V^^gP^ 9 and j[0 ii^jfoejXjaS sequence is to 
preserve the alignment of the -LLLLGAG^^equen^ nwtft ^his class of hybrids can be 
complemented by cassette mutagenesis of each heptad repeat followed by screening of these 
collections of "heptad" libraries in standard coupling assays. 

A third class of hybrids based on the prediction that the amino terminus forms a 
helical domain with a heptahelical repeat unit are those that effect the overall hydrophobic or 
hydrophilic character of the opposing sides of the predicted helical structure (See Lupas et al. 
supra). In this model, the a and d positions of the heptad repeat abcdefg are found to be 
conserved hydrophobic residues that define one face of the helix, while the e and g positions 
define the charged face of the helix. In this class of hybrids, the sequence of the GaS parent 
is maintained except for specific substitutions at one or more of the following critical 
residues to render the different helical faces of GaS more "GPAl-like" 

K8Q 

+1-10 

ElOG 

Q12E 

R13S 

N14D 

E15P 
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E15F 

K17L- 

E21R 

K28Q 

K32L 

V36R 

. This collection of single mutations could be screened for coupling efficiency to yeast 
G0y and then constructed in combinations (double and greater if necessary). 

A fourth class of hybrid molecules that span this region of GPAl-Ga hybrids are 
those that have junctions between GPAl and Ga subunits introduced by three primer PCR. 
In this approach, the two outside primers are encoded by sequences at the initiator 
methionine of GPAl on the 5 1 side and at the tetraleucine motif of GaS (for example) on the 
3' side. A series of junctional primers spanning different junctional points can be mixed with 
the outside primers to make a series of molecules each with different amounts of GPAl and 
GaS sequences, respectively. 

GPA ip and GPA ^\y hybrids . The regions of high homology among Gpy subunits 
that have been identified by sequence alignment are interspersed throughout the molecule. 
The Gl region containing the highly conserved -GSGESGDST- A motif is followed 
immediately by a region of very low sequence consevation, the "iP or insert 1 region. Both 
sequence and length vary considerably among the il regions of the Ga subunits. By aligning 
the sequences of Ga subunits, the conserved regions bounding the il region were identified 
and two additional classes of GPAl-Ga hybrids were constructed. ^^^^^^Pj}^ 
encode the amino terminal 102 residues of GPAl (up to the sequence -QARKLGIQ^ fused 
in frame to mammalian Ga subunits, while the GPALW^^d^encg^e the amino terminal 
244 residues of GPAl (up to the sequence LIHfiDlAKA^in GPAl). The reason for 
constructing the GPAjd and GPAlw hybrids was to test the hypothesis that the il region of 
GPAl is required for mediating the interaction of GPAl with yeast Gpy subunits, for the 
stable expression of the hybrid molecules, or for function of the hybrid molecules. The 
GPAjj) hybrids contain the amino terminal domain of GPAl fused to the il domain of 
mammalian subunits, and therefore do not contain the GPAl il region, while the GPAlw 
hybrids contain the amino terminal 244 residues of GPAl including the entire il region (as 
defined by sequence alignments). Hybrids of both GPAjd and GPAlv/ classes were 
constructed for GaS, C-ai2, Gai3, Gao a , and Gal6; none of these hybrids complemented 
the gpal growth arrest phenotype. 

Subsequent to the construction and testing of the GPAjd and GPALW classes of 
hybrids, the crystal structures of Gt ra nsducin * n both the GDP and GTPyS-liganded form, and 
the crystal structure of several Gail variants in the GTPyS-liganded and GDP-AIF4 forms. 
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were reported (Noel et al. supra; Lambright et al. supra] and Coleman et al.(1994) Science 
265:1405-1412). The crystal structures reveal that the ilregion defined by sequence alignment 
has a conserved structure that is comprised of six alpha helices in a rigid array, and that the 
junctions chosen for the construction of the GPAid and GPAlw hybrids were not 
compatible with conservation of the structural features of the il region observed in the 
crystals. The junction chosen for the GPAjd hybrids falls in the center of the long <xA helix; 
chimerization of this helix in all likelihood destabilizes it and the protein structure in general. 
The same is true of the junction chosen for the GPAlw hybrids in which the crossover point 
between GPA1 and the mammalian Ga subunit falls at the end of the short ccC helix and 
. therefore may distort it and destabilize the protein. 

The failure of the GPAid and GPAlw hybrids is predicted to be due to disruption of 
critical structural elements in the il region as discussed above. Based upon new alignments 
and the data presented in Noel et al (supra), Lambright et al (supra), and Coleman et al 
(supra), this problem can be averted with the ras-like core domain and the il helical domain 
are introduced outside of known structural elements like alpha-helices. 

. Hybrid A GaSl-67/GPA66-299/GaS203-394 

This hybrid contains the entire il insert of GPA1 interposed into the 
GccS sequence. 

Hybrid B GPAl-41/GaS4443-67/GPA66-299/GaS203-394 

This hybrid contains the amino terminal 41 residues of GPA1 in 
place of the 42 amino terminal residues of GccS found in Hybrid A. 



Ga s Hybrids . There is evidence that the "switch region" encoded by residues 171-237 of Ga 
transducin (using the numbering of (Noel et al (supra) also plays a role in GPy coupling. 
First, the G226A mutation in GaS prevents the GTP-induced conformational change that 
occurs with exchange of GDP for GTP u^i^ecg)tor ^ctivation by ligand. This residue maps 
to the highly conservecTsequence -DVUGQ^ present in all Ga subunits and is involved in 
GTP hydrolysis. In both the Gat and Ga il crystal structures, this sequence motif resides in 
the loop that connects the 03 sheet and the a2 helix in the guanine nucleotide binding core. 
In addition to blocking the conformational change that occurs upon GTP binding, this 
mutation also prevents dissociation of GTP-liganded Gas from Gpy. Second, crosslinking 
data reveals that a highly conserved cysteine residue in the a2 helix (C215 in Gao, C210 in 
Gat) can be crosslinked to the carboxy terminal region of Gp subunits. Finally, genetic 
evidence (Whiteway et al. (1993) Mol Cell Biol. 14:3233-3239) identifies an important 
single residue in GPA1 (E307) in the P2 sheet of the core structure that may be in direct 
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contact with Py. A mutation in the GPA1 protein at this position suppresses the constitutive 
signalling phenotype of a variety of STE4 (GP) dominant negative mutations that are also 
known to be defective in Ga-GPy association (as assessed in two-hybrid assay in yeast as 
well as by more conventional genetic tests). 

We have tested the hypothesis that there are switch region determinants involved in 
the association of Got with Gpy by constructing a series of hybrid Ga proteins encoding 
portions of GPA1 and GaS in different combinations. 

Two conclusions may be drawn. First, in the context of the amino terminus of GaS, 
the GPA1 switch region suppresses coupling to yeast Gpy (SGS), while in the context of the 
GPA1 amino terminus the GPA1 switch region stabilizes coupling with Gpy (GPpy-SGS). 
This suggests that these two regions of GPA1 collaborate to allow interactions between Ga 
subunits and Gpy subunits. This conclusion is somewhat mitigated by the observation that 
the GPA4i-Gas hybrid that does not contain the GPA1 switch region is able to complement 
the growth arrest phenotype of gpal strains. We have not to date noted a quantitative 
difference between the behavior of the GPA^-Gas allele and the GPA-I-SGS allele, but if 
this interaction is somewhat degenerate, then it may be difficult to quantitate this accurately. 
The second conclusion that can be drawn from these results is that there are other 
determinants involved in stabilizing the interaction of Ga with GPy beyond these two regions 
as none of the GPAl/Gas hybrid proteins couple as efficiently to yeast Gpy as does native 
GPAL 

The role of the surface-exposed residues of this region may be crucial for effective 
coupling to yeast GPy, and can be incorporated into hybrid molecules as follows below. 

GaS-GPA-Switch GaS l-202/GPA298-350/GaS 253-394 
This hybrid encodes the entire switch region of GPA 1 in the context of GaS. 

GaS-GPA-a2 GQS 1-226/GPA322-332/GQS 238-394 

This hybrid encodes the a 2 helix of GPA1 in the context of GaS. 

GPA41-GaS-GPA-a2GPAl-41/GQS43-226/GPA322-332/GQS238-394 

This hybrid encodes the 41 residue amino terminal domain of GPA1 and the a2 helix 
of GPA1 in the context of GaS. 

Finally, the last class of hybrids that will be discussed here are those that alter the 
surface, exposed residues of the p2 and p3 sheets of aS so that they resemble those of the 
GPA1 QS helix. These altered a2 helical domains have the following structure. (The 



- positions of the altered residues correspond to GaS.) 
L203K 
'K211E 
D215G 
K216S 
D229S 

These single mutations can be engineered into a GaS backbone singly and in pairwise 
combinations. In addition, they can be introduced in the context of both the full length GaS 
and the GPA41-G0CS hybrid described previously. All are predicted to improve the coupling 
of Get subunits to yeast GPy subunits by virtue of improved electrostatic and hydrophobic 
contacts between this region and the regions of Gp defined by Whiteway and coworkers 
(Whiteway et al {supra) that define site(s) that interact with GPA1). 

In summary, the identification of hybrid Got subunits that couple to the yeast 
pheromone pathway has led to the following general observations. First, all GPAbaMHI 
hybrids associate with yeast GPy, therefore at a minimum these hybrids contain the 
determinants in GPA1 necessary for coupling to the pheromone response pathway. Second, 
the amino terminal 41 residues of GPA1 contain sufficient determinants to facilitate coupling 
of Ga hybrids to yeast Gpy in some, but not all, instances, and that some Ga subunits contain 
regions outside of the first 41 residues that are sufficiently similar to those in GPA1 to 
facilitate interaction with GPA1 even in the absence of the amino terminal 41 residues of 
GPA1 . Third, there are other determinants in the first 310 residues of GPA1 that are involved 
in coupling Ga subunits to yeast GPy subunits. 

The various classes of hybrids noted above are not mutually exclusive. For example, a 
GPA1 containing GPA1 -41 could also feature the L203K mutation. 

While, for the sake of simplicity, we have described hybrids of yeast GPA1 and a 
mammalian Gas, it will be appreciated that hybrids may be made of other yeast Ga subunits 
and/or other mammalian Ga subunits, notably mammalian Gai subunits. Moreover, while 
the described hybrids are constructed from two parental proteins, hybrids of three or more 
parental proteins are also possible. 

As shown in the Examples, chimeric Ga subunits have been especially useful in 
coupling receptors to Gai species. 

G. Expression of Go. 

Kang et al. supra reported that several classes of native mammalian G~ subunits were 
able to interact functionally with yeast a subunits when expression of Ga was driven from a 
constitutively active/strong promoter (PGK) or from a strong inducible promoter (CUP). 
These authors reported that rat GaS, Gai2 or Gao expressed at high level coupled to yeast 
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Py. High level expression of mammalian Ga (i.e. non-stoichiometric with respect to yeast Py) 
is not desirable for uses like those described in this application. Reconstruction of G protein- 
coupled receptor signal transduction in yeast requires the signalling component of the 
heterotrimeric complex (Gpy) to be present stoichiometrically with Ga subunits. An excess 
of Ga subunits (as was required for coupling of mammalian Gai2 and Gao to yeast GPy in 
Kang et al.) would dampen the signal in systems where GPy subunits transduce the signal. An 
excess of Ga subunits raises the background level of signaling in the system to unacceptably 
high levels. Preferably, levels of Ga and Gpy subunits are balanced. For example, 
heterologous Ga subunits may be expressed from a low copy (CEN ARS) vector containing . 
.the endogenous yeast GPA1 promoter and the GPA1 3' untranslated region. The minimum 
criterion, applied to a heterologous Gasubunit with respect to its ability to couple 
functionally to the yeast pheromone pathway, is that it complement a gpal genotype when 
expressed from the GPA1 promoter on low copy plasmids or from an integrated, single copy 
. gene. In the work described in this application, all heterologous Ga subunits have been 
assayed in two biological systems. In the first assay heterologous Ga subunits are tested for 
an ability to functionally complement the growth arrest phenotype of gpal strains. In the 
second assay the transcription of a fiisl-HIS3 reporter gene is used to measure the extent to 
which the pheromone response pathway is activated, and hence the extent to which the 
heterologous Ga subunit sequesters the endogenous yeast GPy complex. Mammalian Gas, 
Gai2, Gai3, Gaq, Gal 1, Gal 6, Gao a , Gao^, and Gaz from rat, murine or human origins 
were expressed from a low copy, CEN ARS vector containing the GPA1 promoter. 
Functional complementation of gpa/ strains was not observed in either assay system with any 
of these full-length Ga constructs with the exception of rat and human GaS. 

K Chimeric Yeast Py subunits 

An alternative to the modification of a mammalian Ga subunit for improved signal 
transduction is the modification of the pertinent sites in the yeast Gp or Gy subunits. The 
principles discussed already with respect to Ga subunits apply, mutatis mutandis, to yeast Gp 
or Gy. - — - 

For example, it would not be unreasonable to target the yeast Ste4p Gpsubunit with 
cassette mutagenesis. Specifically, the region of Ste4p that encodes several of the dominant 
negative, signaling-defective mutations would be an excellent target for cassette mutagenesis 
when looking for coupling of yeast GPy to specific mammalian Ga subunits. 

X. Peptide Libraries 

While others have engineered yeast cells to facilitate screening of exogenous drugs as 



receptor agonists and antagonists, the cells did not themselves produce both the drugs and the 
receptors. Yeast cells engineered to produce the receptor, but that do not produce the drugs 
themselves, are inefficient. To utilize them one must bring a sufficient concentration of each 
drug into contact with a number of cells in order to detect whether or not the drug has an 
action. Therefore, a microtiter plate well or test tube must be. used for each drug. The drug 
must be synthesized in advance and be sufficiently pure to judge its action on the yeast cells. 
When the yeast cell produces the drug, the effective concentration is higher. 

Peptide libraries are systems which simultaneously display, in a form which permits 
interaction with a target, a highly diverse and numerous collection of peptides. These 
peptides may be presented in solution (Houghten (1992) Biotechniques 13:412-421), or on 
beads (Lam (1991) Nature 354:82-84), chips (Fodor (1993) Nature 364:555-556), bacteria 
(Ladner USP 5,223,409), spores (Ladner USP '409), plasmids (Cull et al. (1992) Proc Natl 
Acad Sci USA 89:1865-1869) or on phage (Scott and Smith (1990) Science 249:386-390); 
(Devlin (\990)Science 249:404-406); (Cwirla et al. (1990) Proc. Natl Acad ScL 87:6378- 
6382); (Felici (1991) J. Mol Biol 222:301-310); (Ladner supra.). Many of these systems are 
limited in terms of the maximum length of the peptide or the composition of the peptide (e.g., 
Cys excluded). Steric factors, such as the proximity of a support, may interfere with binding. 
Usually, the screening is for binding in vitro to an artificially presented target, not for 
activation or inhibition of a cellular signal transduction pathway in a living cell. While a cell 
surface receptor may be used as a target, the screening will not reveal whether the binding of 
the peptide caused an allosteric change in the conformation of the receptor. 

The Ladner et al. patent, USSN 5,096,815, describes a method of identifying novel 
proteins or polypeptides with a desired DNA binding activity. Semi-random ("variegated") 
DNA encoding a large number of different potential binding proteins is introduced, in 
expressible form, into suitable host cells. The target DNA sequence is incorporated into a 
genetically engineered operon such that the binding of the protein or polypeptide will prevent 
expression of a gene product that is deleterious to the gene under selective conditions. Cells 
which survive the selective conditions are thus cells which express a protein which binds the 
target DNA. While it is taught that yeast cells may be used for testing, bacterial cells are 
preferred. The interactions between the protein and the target DNA occur only in the cell 
(and then only in the nucleus), not in the periplasm or cytoplasm, and the target is a nucleic 
acid, and not a receptor protein. Substitution of random peptide sequences for functional 
domains in cellular proteins permits some determination of the specific sequence 
requirements for the accomplishment of function. Though the details of the recognition 
phenomena which operate in the localization of proteins within cells remain largely 
unknown, the constraints on sequence variation of mitochondrial targeting sequences and 
protein secretion signal sequences have been elucidated using random peptides (Lemire et al., 
J. Biol Chem.{\9%9) 264, 20206 and Kaiser et al. (1987) Science 235:312, respectively). 



-44- 



The peptide library of the present invention takes the form of a cell culture, in which 
essentially each cell expresses one, and usually only one, peptide of the library. While the 
diversity of the library is maximized if each cell produces a peptide of a different sequence, it 
is usually prudent to construct the library so there is some redundancy. Depending on size, 
the combinatorial peptides of the library can be expressed as is, or can be incorporated into 
larger fusion proteins. The fusion protein can provide, for example, stability against 
degradation or denaturation, as well as a secretion signal if secreted. In an exemplary 
embodiment of a library for intracellular expression, e.g., for use in conjunction with 
intracellular target receptors, the polypeptide library is expressed as thioredoxin fusion 
proteins (see, for example, U.S. Patents 5,270,181 and 5,292,646; and PCT publication 
W094/ 02502). The combinatorial peptide can be attached one the terminus of the 
thioredoxin protein, or, for short peptide libraries, inserted into the so-called active loop. 

In one embodiment, the peptide library is derived to express a combinatorial library of 
polypeptides which are not based on any known sequence, nor derived from cDNA. That is, 
the sequences of the library are largely random. In preferred embodiments, the combinatorial 
polypeptides are in the range of 3-100 amino acids in length, more preferably at least 5-50, 
and even more preferably at least 10, 13, 15, 20 or 25 amino acid residues in length. 
Preferably, the polypeptides of the library are of uniform length. It will be understood that 
the length of the combinatorial peptide does not reflect any extraneous sequences which may 
be present in order to facilitate expression, e.g., such as signal sequences or invariant portions 
of a fusion protein. 

In another embodiment, the peptide library is derived to express a combinatorial 
library of polypeptides which are based at least in part on a known polypeptide sequence or a 
portion thereof (not a cDNA library). That is, the sequences of the library is semi-random, 
being derived by combinatorial mutagenesis of a known sequence. See, for example, Ladner 
et al. PCT publication WO 90/02909; Garrard et al., J?CT publication WO 92/09690; Marks 
et al. (1992) 1 Biol Chem. 267:16007-16010; Griffths et al. (1993) EMBO J 12:725-734; 
Clackson et al. (1991) Nature 352:624-628; and Barbas et al. (1992) PNAS 89:4457-4461. 
Accordingly, polypeptide(s) which are known ligands for a target receptor can be 
mutagenized by standard techniques to derive a variegated library of polypeptide sequences 
which can further be screened for agonists and/or antagonists. For example, the surrogate 
ligand identified for FPRL-1, e.g., the Ser-Leu-Leu-Trp-Leu-Thr-Cys-Arg-Pro-Trp-Glu-Ala- 
Met peptide, can be mutagenized to generate a library of peptides with some relationship to 
the original tridecapeptide. This library can be expressed in a reagent cell of the present 
invention, and other receptor activators can be isolated from the library. This may permit the 
identification of even more potent FPRL-1 surrogate ligands. 

Alternatively, the library can be expressed under conditions wherein the cells are in 
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contact with the original tridecapeptide, e.g., the FPRL-1 receptor is being induced by that 
surrogate ligand. Peptides from an expressed library can be isolated based on their ability to 
potentiate the induction, or to inhibit the induction, caused by the surrogate ligand. The latter 
of course will identify potential antagonists of chemoattractant receptors. In still other 
(embodiments, the surrogate ligand can be used to screen exogenous compound libraries 
(peptide and non-peptide) which, by modulating the activity of the identified surrogate, will 
presumably also similarly effect the native ligand's effect on the target receptor: In such 
embodiments, the surrogate ligand can be applied to the cells, though is preferably produced 
by the reagent cell, thereby providing an autocrine cell. 

In still another embodiment, the combinatorial polypeptides are produced from a 
cDNA library. 

In a preferred embodiment of the present invention, the yeast cells collectively 
produce a "peptide library", preferably including at least 10 3 to 10 7 different peptides, so that 
diverse peptides may be simultaneously assayed for the ability to interact with the exogenous 
receptor. In an especially preferred embodiment, at least some peptides of the peptide library 
are secreted into the periplasm, where they may interact with the "extracellular" binding 
site(s) of an exogenous receptor. They thus mimic more closely the clinical interaction of 
drugs with cellular receptors. This embodiment optionally may be further improved (in 
assays not requiring pheromone secretion) by preventing pheromone secretion, and thereby 
avoiding competition between the peptide and the pheromone for signal peptidase and other 
components of the secretion system. 

In the present invention, the peptides of the library are encoded by a mixture of DNA 
molecules of different sequence. Each peptide-encoding DNA molecule is ligated with a, 
vector DNA molecule and the resulting recombinant DNA molecule is introduced into a host 
cell. Since it is a matter of chance which peptide encoding DNA molecule is introduced into 
a particular cell, it is not predictable which peptide that cell will produce. However, based on 
a knowledge of the manner in which the mixture was prepared, one may make certain 
statistical predictions about the mixture of peptides in the peptide library. 

It is convenient to speak of the peptides of the library as being composed of constant 
and variable residues. Iftfie nth residue is the same for all peptides of theJibrary, it is said to 
be constant. If the nth residue varies, depending on the peptide in question, the residue is a 
variable one. The peptides of the library will have at least one, and usually more than one, 
variable residue. A variable residue may vary among any of two to all twenty of the 
genetically encoded amino acids; the variable residues of the peptide may vary in the same or 
different manner. Moreover, the frequency of occurrence of the allowed amino acids at a 
particular residue position may be the same or different. The peptide may also have one or 
more constant residues. 



There are two principal ways in which to prepare the required DNA mixture. In one 
method, the DNAs are synthesized a base at a time. When variation is desired, at a base 
position dictated by the Genetic Code, a suitable mixture of nucleotides is reacted with the 
nascent DNA, rather than the pure nucleotide reagent of conventional . polynucleotide 
synthesis. 

The second method provides more exact control over the amino acid variation. First, 
trinucleotide reagents are prepared, each trinucleotide being a codon of one (and only one) of 
the amino acids to be featured in the peptide library. When a particular variable residue is to 
be synthesized, a mixture is made of the appropriate trinucleotides and reacted with the 
nascent DNA. Once the necessary "degenerate" DNA is complete, it must be joined with the 
DNA sequences necessary to assure the expression of the peptide, as discussed in more detail 
below, and the complete DNA construct must be introduced into the yeast cell 

XL Screening and Selection: Assays of Second Messenger Generation 

When screening for bioactivity of peptides, intracellular second messenger generation 
can be measured directly. A variety of intracellular effectors have been identified as being G- 
protein-regulated, including adenylyl cyclase, cyclic GMP, phosphodiesterases, 
phosphoinositidase C, and phospholipase A2- In addition, G proteins interact with a range of 
ion channels and are able to inhibit certain voltage-sensitive Ca" 1-4 " transients, as well as 
stimulating cardiac K + channels. 

In one embodiment, the GTPase enzymatic activity by G proteins can be measured in 
plasma membrane preparations by determining the breakdown of y32p GTP using techniques 
that are known in the art (For example, see Signal Transduction: A Practical Approach. G. 
Milligan, Ed. Oxford University Press, Oxford England), When receptors that modulate 
cAMP are tested, it will be possible to use standard techniques for cAMP detection, such as 
competitive assays which quantitate [ 3 H]cAMP in the presence of unlabelled cAMP. 

Certain receptors stimulate the activity of phospholipase C which stimulates the 
breakdown of phosphatidylinositol 4,5, bisphosphate to 1,4,5-IP3 (which mobilizes 
intracellular Ca++) and diacylglycerol (DAG) (which activates protein kinase C). Inositol 
lipids can be extracted and analyzed using standard lipid extraction techniques. DAG can 
also be measured using thin-layer chromatography. Water soluble derivatives of all three 
inositol lipids (IP1, IP2, IP3) can also be quantitated using radiolabelling techniques or 
HPLC! 

The mobilization of intracellular calcium or the influx of calcium from outside the 
cell can be measured using standard techniques. The choice of the appropriate calcium 
indicator, fluorescent, bioluminescent, metallochromic, or Ca-H--sensitive microelectrodes 
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depends on the cell type and the magnitude and time constant of the event under study (Borle 
(1990) Environ Health Per sped 84:45-56). As an exemplary method of Ca-H- detection, 
cells could be loaded with the Ca++sensitive fluorescent dye fura-2 or indo-1, using standard 
methods, and any change in Ca-H- measured using a fluorometer. 

The other product of PIP2 breakdown, DAG can also be produced from phosphatidyl 
choline. The breakdown of this phospholipid in response to receptor-mediated signaling can 
also be measured using a variety of radiolabelling techniques. 

The activation of phospholipase A2 can easily be quantitated using known 
techniques, including, for example, the generation of arachadonate in the cell. 

In the case of certain receptors, it may be desirable to screen for changes in cellular 
phosphorylation. Such assay formats may be useful when the receptor of interest is a 
receptor tyrosine kinase. For example, yeast transformed with the FGF receptor and a ligand 
which binds the FGF receptor could be screened using colony immunoblotting (Lyons and 
Nelson (1984) Proc. Natl Acad Sci. USA 81:7426-7430) using anti-phosphotyrosine. In 
addition, tests for phosphorylation could be useful when a receptor which may not itself be a 
tyrosine kinase, activates protein kinases that function downstream in the signal transduction 
pathway. Likwise, it is noted that protein phosphorylation also plays a critical role in 
cascades that serve to amplify signals generated at the receptor. Multi-kinase cascades allow 
not only signal amplification but also signal divergence to multiple effectors that are often 
cell-type specific, allowing a growth factor to stimulate mitosis of one cell and differentiation 
of another. ^ 

One such cascade is the MAP kinase pathway that appears to mediate both mitogenic, 
differentiation and stress responses in different cell types. Stimulation of growth factor 
receptors results inHas activation followed by the sequential activation of c-Raf, MEK, and 
p44 and p42 MAP kinases (ERK1 and ERK2). Activated MAP kinase then phosphorylates 
many key regulatory proteins, including p90RSK and Elk-1 that are phosphorylated when 
MAP kinase translocates to the nucleus. Homologous pathways exist in mammalian and 
yeast cells. For instance, an essential part of theJS. cerevisiae pheromone signaling pathway 
is comprised of a protein kinase cascade composed of the products of the STE1 1, STE7, and 
FUS3/KSS1 senes (the latter pair are distinct and functionally redundant). Accordingly, 
phosphorylation and/or activation of members of this kinase cascade can be detected and 
used to quantitate receptor engagement. Phosphotyrosine specific antibodies are available to 
measure increases in tyrosine phosphorylation and phospho-specific antibodies are 
commercially available (New England Biolabs, Beverly, MA). 

Modified methods for detecting receptor-mediated signal transduction exist and one 
of skill in the art will recognize suitable methods that may be used to substitute for the 
example methods listed. 
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XII Screening and Selection Using Reporter Gene Constructs 

In addition to measuring second messenger production, reporter gene constructs can 
be used. Reporter gene constructs are prepared by operatively linking a reporter gene with at 
least one transcriptional regulatory element. If only one transcriptional regulatory element is 
included it must be a regulatable promoter, At least one the selected transcriptional regulatory 
elements must be indirectly or directly regulated by the activity of the selected cell-surface 
receptor whereby activity of the receptor can be monitored via transcription of the reporter 
genes. 

The construct may contain additional transcriptional regulatory elements, such as a 
FIRE sequence, or other sequence, that is not necessarily regulated by the cell surface protein, 
but is selected for its ability to reduce background level transcription or to amplify the 
transduced signal and to thereby increase the sensitivity and reliability of the assay. 

Many reporter genes and transcriptional regulatory elements are known to those of . 
skill in the art and others may be identified or synthesized by methods known to those of skill 
in the art. Reporter genes 

A reporter gene includes any gene that expresses a detectable gene product, which 
may be RNA or protein. Preferred reporter genes are those that are readily detectable. The 
reporter gene may also be included in the construct in the form of a fusion gene with a gene 
that includes desired transcriptional regulatory sequences or exhibits other desirable 
properties. 

Examples of reporter genes include, but are not limited to CAT (chloramphenicol 
acetyl transferase) (Alton and Vapnek (1979), Nature 282: 864-869) luciferase, and other 
enzyme detection systems, such as beta-galactosidase; firefly luciferase (deWet et al. (1987), 
Mol. Cell. Biol. 7:725-737); bacterial luciferase (Engebrecht and Silverman (1984), PNAS 1: 
4154-4158; Baldwin et al. (1984), Biochemistry 23: 3663-3667); alkaline phosphatase (Toh 
et al. (1989) Eur. J. Biochem. 182: 231-238, Hall et al. (1983) J. Mol. Appl. Gen. 2: 101), 
human placental secreted alkaline phosphatase (Cullen and Malim (1992) Methods in 
- Enzymol. 216:362-368). — - 

Transcriptional control elements include, but are not limited to, promoters, enhancers, 
and repressor and activator binding sites. Suitable transcriptional regulatory elements may be 
derived from the transcriptional regulatory regions of genes whose expression is rapidly 
induced, generally within minutes, of contact between the cell surface protein and the effector 
protein that modulates the activity of the cell surface protein. Examples of such genes 
include, but are not limited to, the immediate early genes (see, Sheng et al. (1990) Neuron 4: 
477-485), such as c-fos, Immediate early genes are genes that are rapidly induced upon 
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binding of a ligand to a cell surface protein. The transcriptional control elements that are 
preferred for use in the gene constructs include transcriptional control elements from 
immediate early genes, elements derived from other genes that exhibit some or all of the 
characteristics of the immediate early genes, or synthetic elements that are constructed such 
that genes in operative linkage therewith exhibit such characteristics. The characteristics of 
preferred genes from which the transcriptional control elements are derived include, but are 
not limited to, low or undetectable expression in quiescent cells, rapid induction at the 
transcriptional level within minutes of extracellular simulation, induction that is transient and 
independent of new protein synthesis, subsequent shut-off of transcription requires new 
protein synthesis, and mRNAs transcribed from these genes have a short half-life. It is not 
necessary for all of these properties to be present. 

In the most preferred constructs, the transcriptional regulatory elements are derived 
from the c-fos gene. 

The c-fos proto oncogene is the cellular homolog of the transforming gene of FBJ 
osteosarcoma virus. It encodes a nuclear protein that most likely involved in normal cellular 
growth and differentiation. Transcription of c-fos is transiently and rapidly activated by 
growth factors and by other inducers of other cell surface proteins, including hormones, 
differentiation-specific agents, stress, mitogens and other known inducers of cell surface 
proteins. Activation is protein synthesis independent. The c-fos regulatory elements include 
(see, Verma et al. (1987) Cell 51: a TATA box that is required for transcription initiation; two 
upstream elements for basal transcription, and an enhancer, which includes an element with 
dyad symmetry and which is required for induction by TPA, serum, EGF, and PMA. 

The~20 bp transcriptional enhancer element located between - 317 and - 298 bp 
upstream from the c-fos mRNA cap site, which is essential for serum induction in serum 
starved NIH 3T3 cells. One of the two upstream elements is located at -63-57 and it 
resembles the consensus sequence for cAMP regulation. 

Other promoters and transcriptional control elements, in addition to those described 
above, include the vasoactive intestinal peptide (VIP) gene promoter (cAMP responsive; Fink 
et al. (1988), Proc. Natl. Acad. Sci. 85:6662-6666); the somatostatin gene promoter (cAMP 
responsive; Montminy et al. (1986), Proc. Natl. Acad. Sci. 8.3:6682-6686); the proenkephalin 
promoter (responsive to cAMP, nicotinic agonists, and phorbol esters; Comb et al. (1986), 
Nature 323:353-356); the phosphoenolpyruvate carboxy-kinase gene promoter (cAMP 
responsive; Short et al. (1986), J. Biol. Chem. 261:9721-9726); the NGFI-A gene promoter 
(responsive to NGF, cAMP, and serum; Changelian et al. (1989). Proc. Natl. Acad. Sci. 
86:377-381); and others that may be known to or prepared by those of skill in the art. 

In certain assays it may be desirable to use changes in growth in the screening 
procedure. For example, one of the consequences of activation of the pheromone signal 
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6 polypeptide^yj^jc^ ina^jthemselves be either agonistic or antagonistic of the S-L-L-W-L-T- 
Q C-R-P-W-E-A-M^epticie. Thus, using such mutagenic techniques as known in the art, the 
£ determinants of S-L-L-W-L-T-C-R-P-W-E-A^^ofy^tifle which participate in FPRL-1 
interactions can be ellucidated. To illustrate, the critical residues of a subject polypeptide 
which are involved in molecular recognition of an FPRL-1 receptor can be determined and 
0 used to generate variant ]^^^ t ^ s i ^ ch competitively inhibit binding of the authentic S- 
g L-L-W-L-T-C-R-P-W-E-A-M A peptide with that receptor. By employing, for example, 
scanning mutagenesis to map the amino acid residues of the polypeptide involved in binding 
the FPRL-1 receptor, peptide and peptidomimetic compounds can be generated which mimic 
. those residues in binding to the receptor and which consequently can inhibit binding of an 
. authentic ligand for the FPRL-1 receptor and interfere with the function of that receptor. 

Moreover, as is a PP^ en y^^^ f M esen t and parent disclosures, mimetopes of the 
subject S-L-L-W-L-T-C-R-P-W-E^-\^peptide can be provided as non-hydrolyzable peptide 
analogs. For illustrative purposes, peptide analogs of the present invention can be generated 
using, for example, benzodiazepines (e.g., see Freidinger et al. in Peptides: Chemistry and 
Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), substituted 
gama lactam rings (Garvey et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., 
ESCOM Publisher: Leiden, Netherlands, 1988, pl23), C-7 mimics (Huffman et al. in 
Peptides: Chemistry and Biologyy, G.R. Marshall ed., ESCOM Publisher: Leiden, 
Netherlands, 1988, p. 105), keto-methylene pseudopeptides (Ewenson et al. (1986) J Med 
Chem 29:295; and Ewenson et al. in Peptides: Structure and Function (Proceedings of the 9th 
American Peptide Symposium) Pierce Chemical Co. Rockland, IL, 1985), P-turn dipeptide 
cores (Nagai et al. (1985) Tetrahedron Lett 26:647; and Sato et al. (1986) J Chem Soc Perkin 
Trans 1:1231), p-aminoalcohols (Gordon et al. (1985) Biochem Biophys Res 
Communl26:419; and Dann et al. (1986) Biochem Biophys Res Commun 134:71), 
diaminoketones (Natarajan et al. (1984) Biochem Biophys Res Commun 124:141), and 
methyleneamino-modifed (Roark et al. in Peptides: Chemistry and Biology, G.R. Marshall 
ed., ESCOM Publisher: Leiden, Netherlands, 1988, pi 34). Also, see generally, Session III: 
Analytic and synthetic methods, in in Peptides: Chemistry and Biology, G.R. Marshall ed., 
ESCOM Publisher: Leiden, Netherlands, 1988) 

In an exemplary embodiment, the peptidomimetic can be derived as a retro-inverso 
analog of the peptide. To illustrate, the S-L-L-W-L-T-C-R-P-W-E-A-M^ptrae can be 
generated as the retro-inverso analog: 
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Such retro-inverso analogs can be made according to the methods known in the art, 
such as that described by the Sisto et al. U.S. Patent 4,522,752. For example, the illustrated 
retro-inverso analog can be generated as follows. The geminal diamine corresponding to the 
serine analog is synthesized by treating a protected serine with ammonia under HOBT-DCC 
coupling conditions to yield the N-Boc amide, and then effecting a Hofmann-type 
rearrangement with I,I-bis-(trifluoroacetoxy)iodobenzene (TIB), as described in 
Radhakrishna et al. (1979) J. Qrg. Chem. 44:1746. The product amine salt is then coupled to 
a side-chain protected (e.g., as the benzyl ester) N-Fmoc D-Leu residue under standard 
conditions ' to yield the pseudodipeptide. The Fmoc (fluorenylmethoxycarbonyl) group is 
removed with piperidine in dimethylformamide, and the resulting amine is trimethylsilylated 
with bistrimethylsilylacetamide (BSA) before condensation with suitably alkylated, side- 
chain protected derivative of Meldrum ! s acid, as described in U.S. Patent 5,061,811 to Pinori 
et al., to yield the retro-inverso tripeptide analog S-L-L. The pseudotripeptide is then 
coupled with L-Trp under standard conditions to give the protected tetrapeptide analog. The 
protecting groups are removed to release the product, and the steps repeated to enlogate the 
tetrapeptide to the full length peptide. It will be understood that a mixed peptide, e.g. 
including some normal peptide linkages, can be generated. As a general guide, sites which 
are most susceptible to proteolysis are typically altered, with less susceptible amide linkages 
being optional for mimetic switching The final product, or intermediates thereof, can be 
purified by HPLC. 

In another illustrative embodiment, the peptidomimetic can be derived as a retro- 
enatio analog of the peptide, such as the e^mDlary retro-enatio peptide analog derived for 
the illustrative S-L-L- W-L-T-C-R-P-W-E-A-M^peptide: 
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NH 3 -(d) Met-(d) Ala-(d) Glu-(d)Trp.... (d) Trp- (d) Leu-(d)-Leu-(d) Ser 

Retro-enantio analogs such as this can be synthesized using commercially available 
D-amino acids and standard solid- or solution-phase peptide-synthesis techniques. For 
example, in a preferred solid-phase synthesis method, a suitably amino-protected (t- 
butyloxycarbonyl, Boc) D-Serine residue (or analog thereof) is covalently bound to a solid 
support such as chloromethyl resin. The resin is washed with dichloromethane (DCM), and 
the BOC protecting group removed by treatment with TFA in DCM. The resin is washed and 
neutralized, and the next Boc-protected D-amino acid (D-Leu) is introduced by coupling with 
diisopropylcarbodiimide. The resin is again washed, and the cycle repeated for each of the 
remaining amino acids in turn (D-Leu, D-Trp etc). When synthesis of the protected retro- 
enantio peptide is complete, the protecting groups are removed and the peptide cleaved from 
the solid support by treatment with hydrofluoric acid/anisole/dimethyl sulfide/thioanisole. 
The final product is purified by HPLC to yield the pure retro-enantio analog. 

In still another illustrative embodiment, trans-olefin derivatives can be made for the 




The trans olefin analog of the subject peptide can be synthesized according to the method of 
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Y.K. Shue et al. (1987) Tetrahedron Letters 28:3225. 

Still another class of peptidomimetic derivatives include the pho§phonate derivatives, 
such as the partially phosphbnate derivatived S-L-L-W-L-T-C-R-P-W-E-A-M peptide: 




The synthesis of such phosphonate derivatives can be adapted from known synthesis 
schemes. See, for example, Loots et al. in Peptides: Chemistry and Biology, (Escom Science 
Publishers, Leiden, 1988, p. 118); Petrillo et al. in Peptides: Structure and Function 
(Proceedings of the 9th American Peptide Symposium, Pierce Chemical Co. Rockland, IL, 
1985). 

XV. Novel C5a lizands 

Still another aspect of the invention pertains to a novel ligand for the C5a receptor. 

As described in Example 12, several 13-mer and 1 1-mer peptides have been identified from a 

polypeptide library on the basis of their ability to act as surrogate ligands for the C5a 

receptor. The sequence for exemplary C5a receptor ligands is provided in Figure 7. Yet 

another preferred C5a ligand^^h^(fes v or a portion of the peptide Asp-Thr-Arg-Ser-Trp- 

Lys-Leu-Arg-Leu-Leu-Trp-Eeu-AJa, described in the appended examples. 

A 

The importance of the C5a receptor finds its origin in its relationship with 
complement derived C5a and. its role in the overall immune response. In man, and in most 
animals, the complement system is composed of a group of proteins that are normally present 
in serum in an inactive state. When activated, these proteins participate in a coordinated 
series of reactions. Activation of the complement system results in enzymatic cleavage of 
complement proteins producing subfragments which possess a wide range of biologic 
activities required for host defense, including bloodclotting and inflammatory responses, as 
well as activation of immune response directed to the elimination of invading 



-59- 



microorganisms. During an inflammatory process, local production of complement-derived 
mediators result in increased vascular permeability, leukocyte adherence to endothelial and 
vascular tissue, and a chemotactic gradient that induces neutrophil (PMN) migration into the 
inflammatory site. In addition to beneficial aspects of the inflammatory process, systemic 
and/or chronic inflammatory processes have been associated with a variety of immune 
disease states. The anaphylatoxin C5a is one of the best described and most potent 
proinflammatory mediators derived from the complement system. C5a has been shown to be 
spasmogenic (Stimler et al. (1981) J. Immunol. 126:2258), chemotactic (Hugh et al. (1978) 
Adv. Immunol. 26:1), to increase vascular permeability (Shin et al. (1968) Science 162:361), 
and to induce the release of pharmacologically active mediators from numerous cell types 
(Grant et al. (1975) J. Immunol. 114:1101; Goldstein et al. (1973) J. Immunol. 113:1583; 
Schorlemmer et al. (1976) Nature 261:48). Most recently, C5a has been shown to directly or 
indirectly induce cytokine release from macrophages and to augment humoral- and cell- 
mediated immune responses in vitro. Combined, these studies indicate that C5a possesses 
multiple biologic activities important in host defense and may also play a role in 
inflammatory disease processes. Many cell types possess receptors for C5a, including PMNs, 
macrophages, mast cells and platelets. 

Among the various cell types, the neutrophil response to C5a is the best defined. Cell 
surface receptors specific for C5a have been demonstrated on the neutrophil (Chenoweth et 
al. (1978) PNAS 75:3943; Huey et al. (1985) J. Immunol. 135:2063; Rollins-et al. (1985) J. 
Biol. Chem. 260:7157), and the ligand-receptor interaction has been shown to promote human 
polymorphonuclear leukocyte (PMN) migration in a directed fashion (chemotaxis), 
adherence, oxidative burst, and granular enzyme release from these cells (Hugh et al. (1984) 
Springer Semin. Immunopathol. 7:193). The interaction of C5a with PMN and other target 
cells and tissues' results in increased histamine release, vascular permeability, smooth muscle 
contraction, and an influx into tissues of inflammatory cells, including neutrophils, 
eosinophils, and basophils (Hugh et al., supra). C5a may also be important in mediating 
inflammatory effects of phagocytic mononuclear - cells that accumulate at sites of chronic 
inflammation (Allison et al. (1978) Agents and Actions 8:27). C5a and C5a des-Arg can 
induce chemotaxis in monocytes (Ward et al. (1968) J. Exp. Med. 128:1201. Snyderman et al. 
(1979) J. Immunol. 109:896) and cause them to release lysosomal enzymes in a manner 
analogous to the neutrophil responses elicited by these agents. Other studies suggest that C5a 
may have an immunoregulatory role by enhancing antibody particularly at sites of 
inflammation (Morgan et al. (1982) 7. Exp. Med. 155:1412; Weigle et al. (1982) Federation 
Proc. 41 :3099; and Morgan et al. (1984) Federation Proc. 43:2543). 

Accordingly, the peptides identified by the instant assay as C5a ligands can be used 
therapeutically to enhance inflammatory responses. As above, the term "peptide" is used 
herein to refer to a chain of two or more amino acids or amino acid analogs (including non- 
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naturally occurring amino acids), with adjacent amino acids joined by peptide (-NHCO-) 
bonds. Thus, the peptides of the present invention include oligopeptides, polypeptides, and 
proteins. The peptide (or peptidomimetic) is preferably at least 3 amino acid residues in 
length, though peptides of any length up to 13, including peptides of 4, 5, 7, 10, 13 or more 
residues in length are preferred. Longer peptides are also specifically contemplated. For 
example, the sequence derived from the C5a surrogate ligand can be provided as part of a 
fusion protein. The minimum peptide length is chiefly dictated by the .need to obtain 
sufficient potency and selectivity as an activator or inhibitor. Given the size of the peptide 
isolated in subject assay, smaller fragments of the 11-mer and 13-mer peptides which retain 
C5a receptor binding activity will be easily identified, e.g., by chemical synthesis of different 
fragments. The maximum peptide length will only be a function of synthetic convenience 
once an active peptide is identified. 

The invention also provides for the generation of mimetics, e.g. peptide or non- 
peptide agents, of the subject C5a receptor ligands. Moreover, the present invention also 
contemplates variants of the subject C5a ligands which may themselves be either agonistic or 
' antagonistic of the C5a receptor activity. Thus, using such mutagenic techniques as known 
in the art, the determinants of peptide which participate in interaction with the C5a receptor 
can be ellucidated. To illustrate, the critical residues of a subject polypeptide which are 
involved in molecular recognition of a C5a receptor can be determined and used tQ generate 
variant polypeptides which competitively inhibit binding of the original peptide with that 
receptor. By employing, for example, scanning mutagenesis to map the amino acid residues 
of the polypeptide involved in binding the C5a receptor, peptide and peptidomimetic 
compounds can be generated which mimic those residues in binding to the receptor and 
which consequently can inhibit binding of an authentic ligand for the C5a receptor and 
interfere with the function of that receptor. Such C5a receptor antagonists can be useful as 
inhibitors of inflammation, e.g., in the treatment of anaphylaxis. 

Moreover, as is apparent from the present and parent disclosures, mimetopes of the 
subject C5a ligands can be provided as non-hydrolyzable peptide analogs. For illustrative 
purposes, peptide analogs of the present invention can be generated using, for example, 
- benzodiazepines (e.g., see-Freidinger et al. in Peptides: Chemistry and Biology, G.R. 
Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), substituted gama lactam rings 
(Garvey et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: 
Leiden, Netherlands, 1988, pi 23), C-7 mimics (Huffman et al. in Peptides: Chemistry and 
■ Biologyy, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988, p. 105), keto- 
methylene pseudopeptides (Ewenson et al. (1986) J Med Chem 29:295; and Ewenson et al. in 
Peptides: Structure and Function (Proceedings of the 9th American Peptide Symposium) 
Pierce Chemical Co. Rockland, IL, 1985), p-turn dipeptide cores (Nagai et al. (1985) 
Tetrahedron Lett 26:647; and Sato et al. (1986) J Chem Soc Perkin Trans 1:1231), P- 
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aminoalcohols (Gordon et al. (1985) Biochem Biophys Res Communl26:419; and Dann et 
al. (1986) Biochem Biophys Res Commun 134:71), diaminoketones (Natarajan et al. (1984) 
Biochem Biophys Res Commun 124:141), and methyleneamino-modifed (Roark et al. in 
Peptides: Chemistry and Biology, G.R. Marshall, ed., ESCOM Publisher: Leiden, 
Netherlands, 1988, pl34). Also, see generally, Session III: Analytic and synthetic methods, 
in in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, 
Netherlands, 1988). 

XVL Further manipulation of peptide ligands . 

The above examples provide guidance for a variety of techniques for manipulating 
peptide ligands indentified in the present screening assay in order to develop more specific 
and/or potent agonists or antagonists. In addition, a variety of combinatorial techniques are 
known in the art and will be useful for further optimization of the peptide leads coming of the 
instant assay. For example, alanine scanning mutagenesis and the like (Lowman et al. 
(1991) Biochemistry 30:10832-10838; and Cunningham et al. (1989) Science 244:1081-1085), 
by linker scanning mutagenesis (Brown et al. (1992) Mol Cell Biol 12:2644-2652; McKnight 
et al. (1982) Science 232:316); by saturation mutagenesis (Meyers et al. (1986) Science 
232:613); by PCR mutagenesis (Leung et al. (1989) Method Cell Mol Biol 1:11-19); or by 
random mutagenesis (Miller et al. (1992) A Short Course in Bacterial Genetics, CSHL Press, 
Cold Spring Harbor, NY) can be used to create libraries of variants which can be further 
screened, even by simple receptor binding assays, for receptor binding activity. To further 
illustrate the state of the art, it is noted that the review article of Gallop et al. (1994) J Med 
Chem 37:1233 describe the general state of the art of combinatorial libraries. In particular, 
Gallop et al state at page 1239 n [s]creening the analog libraries aids in determining the 
minimum size of the active sequence and in identifying those residues critical for binding 
and intolerant of substitution". 

For the most part, the amino acids used in the subject receptor agonists and 
antagonists of this invention will be those naturally occurring amino acids found in proteins, 
or the naturally occurring anabolic or catabolic products of such amino acids which contain 
amino and carboxyl groups. Particularly suitable amino acid side chains include side chains 
selected from those of the following amino acids: glycine, alanine, valine, cysteine, leucine, 
isoleucine, serine, threonine, methionine, glutamic acid, aspartic acid, glutamine, asparagine, 
lysine, arginine, proline, histidine, phenylalanine, tyrosine, and tryptophan. 

However, the term amino acid residue further includes analogs, derivatives and 
congeners of any specific amino acid referred to herein. For example, the present invention 
contemplates the use of amino acid analogs wherein a side chain is lengthened or shortened 
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cyclization, as well as amino acid analogs having variant side chains with appropriate 
functional groups). For instance, the subject peptidomimetic can include an amino acid 
analog as for example, b-cyanoalanine, canavanine, djenkolic acid, norleucine, 3- 
phosphoserine, homoserine, dihydroxyphenylalanine, 5-hydroxytryptophan, 1 - 
methylhistidine, or 3-methylhistidine. Othfer naturally occurring amino acid metabolites or 
precursors having side chains which are suitable herein will be recognized by those skilled in 
the art and are included in the scope of the present invention. 

Also included are the D and L stereoisomers of such amino acids when the structure 
of the amino acid admits of stereoisomeric forms. The configuration of the amino acids and 
amino acid residues herein are designated by the appropriate symbols D, L or DL, 
furthermore when the configuration is not designated the amino acid or residue can have the 
configuration D, L or DL. It will be noted that the structure of some of the compounds of 
this invention includes asymmetric carbon atoms. It is to be understood accordingly that the 
isomers arising from such asymmetry are included within the scope of this invention. Such 
isomers are obtained in substantially pure form by classical separation techniques and by 
sterically controlled synthesis. For the purposes of this application, unless expressly noted to 
the contrary, a named amino acid shall be construed to include both the D or L stereoisomers, 
preferably the L stereoisomer. 

XVII. Pharmaceutical Preparations of Identified Agents 

After identifying certain test compounds as potential surrogate ligands, or receptor 
antagonists, the practioner of the subject assay will continue to test the efficacy and 
specificity of the selected compounds both in vitro and in vivo. Whether for subsequent in 
vivo testing, or for administration to an animal as an approved drug, agents identified in the 
subject assay can be formulated in pharmaceutical preparations for in vivo administration to 
an animal, preferably a human. - 

The compounds selected in the subject assay, or a pharmaceutically acceptable salt 
thereof, may accordingly be formulated for administration witff a biologically acceptable 
medium, such as water, buffered saline, pplyol (for example, glycerol, propylene glycol, 
liquid polyethylene glycol and the like) or suitable mixtures thereof. The optimum 
concentration of the active ingredient(s) in the chosen medium can be determined 
empirically, according to procedures well known to medicinal chemists. As used herein, 
"biologically acceptable medium" includes any and all solvents, dispersion media, and the 
like which may be appropriate for the desired route of administration of the pharmaceutical 
preparation. The use of such media for pharmaceutically active substances is known in the 
art. Except insofar as any conventional media or agent is incompatible with the activity of 
the compound, , its use in the pharmaceutical preparation of the invention is contemplated. 
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Suitable vehicles and their formulation inclusive of other proteins are described, for example, 
in the book Remington's Pharmaceutical Sciences (Remington's Pharmaceutical Sciences. 
Mack Publishing Company, Easton, Pa., USA 1985). These vehicles include injectable 
"deposit formulations". Based on the above, such pharmaceutical formulations include, 
although not exclusively, solutions or freeze-dried powders of the compound in association 
with one or more pharmaceutically acceptable vehicles or diluents, and contained in buffered 
media at a suitable pH and isosmotic with physiological fluids. In preferred embodiment, the 
compound can be disposed in a sterile preparation for topical and/or systemic administration. 
In the case of freeze-dried preparations, supporting excipients such as, but not exclusively, 
mannitol or glycine may be used and appropriate buffered solutions of the desired volume 
will be provided so as to obtain adequate isotonic buffered solutions of the desired pH. 
Similar solutions may also be used for the pharmaceutical compositions of compounds in 
isotonic solutions of the desired volume and include, but not exclusively, the use of buffered 
saline solutions with phosphate or citrate at suitable concentrations so as to obtain at all times 
isotonic pharmaceutical preparations of the desired pH, (for example, neutral pH). 

Exemplification 

The invention now being generally described will be more readily understood by 
reference to the following examples, which are included merely for purposes of illustration of 
certain aspects and embodiments of the present invention and are not intended to limit the 
invention. 

Example 1: Development of Autocrine Yeast Strains 

In this example, we describe a pilot experiment in which haploid cells were 
engineered to be responsive to their own pheromones. (Note that in the examples, functional 
genes are capitalized and inactivated genes are in lower case.) For this purpose we 
constructed recombinant DNA molecules designed to: 

i. place the coding region of STE2 under the transcriptional control of elements 
which normally direct theTranscriptknrof STE3. This is done in a plasmid that allows the 
replacement of genomic STE3 of S. cerevisiae with sequences wherein the coding sequence 
of STE2 is driven by STE3 transcriptional control elements. 

ii. place the coding region of STE3 under the transcriptional control of elements 
which normally direct the transcription of STE2. This is done in a plasmid which will allow 
the replacement of genomic STE2 of S. cerevisiae with sequences wherein the coding 
sequence of STE3 is driven by STE2 transcriptional control elements. 
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The sequence of the STE2 gene is known see Burkholder A.C. and Hartwell L.H. 
(1985), Nuc. Acids Res., 13, 8463; Nakayama N., Miyajima A., Arai K. (1985) EMBO J. 4, 
2643, 

A 4.3 kb BamHI fragment that contains the entire STE2 gene was excised from 
plasmid YEp24-STE2 (obtained from J. Thorner, Univ. of California) and cloned into 
pALTER (Protocols and Applications Guide, 1991, Promega Corporation, Madison, WI). 
An Spel site was introduced 7 nucleotides (nts) upstream of the ATG of STE2 with the 
following mutagenic oligonucleotide, using the STE2 minus strand as template: 

5-GTTAAGAACCATATACTAGTATCAAAAATGTCTG 3^ /W-s) 

A second Spel site was simultaneously introduced just downstream of the STE2 stop 
codon with the following mutagenic oligonucleotide: 

5 f -TGATCAAAATTTACJAGTTO yJ&tft&M^) 

The BamHI fragment of -the resulting plasmid (Cadus 1096) containing STE2 with 
Spel sites immediately flanking the coding region, was then subcloned into the yeast 
integrating vector YIpl9 to yield Cadus 1 143. 

The STE3 sequence is also known (Nakayama N., Miyajima A., Arai K. (1985), 
EMBO J. 4, 2643; (Hagen D.C., McCaffrey G., Sprague G.F. (1986), Proc. Natl Acad. Set 
83, 1418. STE3 was made available by Dr. J. Broach as a 3.1 kb fragment cloned into 
pBLUESCRIPT-KS II (Stratagene, 11011 North Torrey Pines Road, La Jolla, CA 92037). 
STE3 was subcloned as a KpnI-Xbal fragment into both M13mpl8 RF (to yield Cadus 1 105 
and pUC19 (to yield Cadus 1107). The two Spel sites in Cadus 1107 were removed by 
digestion with Spel, fill-in with DNA polymerase I Klenow fragment, and recircularization 
by blunt-end ligation. Single-stranded DNA containing the minus strand of STE3 was 
obtained using Cadus 1105 arid Spel sites were introduced 9 nts upstream of the start codon 
and 3 nts downstream of the stop codon of STE3 with the following mutagenic 
oligonucleotides, respectively: 

5 f -GGCAAAATACTAGTAAAATTTTCATGTC 3' ih ^ 

5'-GGCCCTTAACACACTAGTGTCGCATTATATTTAC 3'6o&fcW«) 

The mutagenesis was accomplished using the T7-GEN protocol of United States 
Biochemical (T7-GEN In Vitro Mutagenesis Kit, Descriptions and Protocols, 1991, United 
States Biochemical, P.O. Box 22400, Cleveland, Ohio 44122). The replicative form of the 
resulting Cadus 1141 was digested with Aflll and Kpnl, and the approximately 2 kb fragment 
containing the entire coding region of STE3 flanked by the two newly introduced Spe I sites 
was isolated and ligated with the approximately 3.7 kb vector fragment of Aflll- and Kpnl- 
digested Cadus 1107, to yield Cadus 1138. Cadus 1138 was then digested with Xbal and 
Kpnl, and the STE3-containing 2.8 kb fragment was ligated into the Xbal- and KpnI-digested 
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yeast integrating plasmid pRS406 (Sikorski, R.S. and Hieter, P. (1989) , Genetics 122:19-27) 
to yield Cadus 1 145. 

The Spel fragment of Cadus 1143 was replaced with the Spel fragment of Cadus 
1 145 to yield Cadus 1 147, in which the coding sequences of STE3 are under the control of 
STE2 expression elements. Similarly, the Spel fragment of Cadus 1145 was replaced with 
the Spel fragment of Cadus 1143 to yield Cadus 1148, in which the coding sequences of 
STE2 are under the control of STE3 expression elements. Using the method of pop-in/pop- 
out replacement (Rothstein, R. (1991) Methods in Enzymology, 194:281 301), Cadus 1147 
was used to replace genomic STE2 with the ste2-STE3 hybrid in a MATa cell and Cadus 
1 148 was used to replace genomic STE3 with the ste3-STE2 hybrid in a MATa cell. Cadus 
1 147 and 1 148 contain the selectable marker URA3. 

Haploid yeast of mating type a which had been engineered to express HIS3 under the 
control of the pheromone-inducible FUS1 promoter were transformed with CADUS 1147, 
and transformants expressing URA3 were selected. These transformants, which express both 
Ste2p and Ste3p, were plated on 5-fluoroorotic acid to allow the selection of clones which 
had lost the endogenous STE2, leaving in its place the heterologous, integrated STE3. Such 
cells exhibited the ability to grow on media deficient in histidine, indicating autocrine 
stimulation of the pheromone response pathway. 

Similarly, haploids of mating type a that can express HI S3 under the control of the 
pheromone-inducible FUS1 promoter were transformed with CADUS 1148 and selected for 
replacement of their endogenous STE3 with the integrated STE2. Such cells showed, by 
their ability to grow on histidine-deficient media, autocrine stimulation of the pheromone 
response pathway. 

Example?: Strain Development 

In this example, yeast strains are constructed which will facilitate selection of clones 
which exhibit autocrine activation of the pheromone response pathway. To construct 
appropriate yeast strains, we will use: the YIp-STE3 and pRS-STE2 knockout plasmids 
described above, plasmids available for the knockout of FAR1, SST2, and HIS3, and mutant 
strains that are commonly available in the research community. The following haploid 
strains will be constructed, using one-step or two-step knockout protocols described in Meth 
Enzymol 194:281-301, 1991: 

FUS1::HIS3 
FUS1::HIS3 

mfal mfa2 FUS1::HIS3 
mfal mfa2 FUS1::HIS3 



1. MATa ste3::STE2::ste3 farl sst2 

2. MATa ste2::STE3::ste2 farl sst2 

3. MATa ste3::STE2::ste3 farl sst2 

4. MATa ste2::STE3::ste2 farl sst2 
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5. MATa barl farUl fusl-HIS3 stel4::TRPl ura3 trpl leu2 his3 

6. MATa mfal mfa2 farl-1 his3::fusl-HIS3 ste2-STE3 ura3 metl 



adel leu2 



Strains 1 and 2 will be tested for their ability. to grow on histidine-deficient media as a 



result of autocrine stimulation of their pheromone response pathways by the pheromones 
which they secrete. If these tests prove successful, strain 1 will be modified to inactivate 
endogenous MFal and MFa2, The resulting strain 3, MATa far! sst2 ste3::STE2::ste3 
FUS1::HIS3 mfal mfa2 t should no longer display the selectable phenotype (i.e., the strain 
should be auxotrophic for histidine). Similarly, strain 2 will be modified to inactivate 
■endogenous MFal and MFa2. The resulting strain 4, MATa far I sst2 ste2::STE3::ste2 
FUS1::HIS3 mfal mfa2 t should be auxotrophic for histidine. The uses of strains 5 and 6 are - 
outlined in Examples 3 and 4 below. 

Example 3: Peptide Library 

In this example, a synthetic oligonucleotide encoding a peptide is expressed so that 
the peptide is secreted or transported into the periplasm. 

i. . The region of MFal which encodes mature cc-factor has been replaced via 
single-stranded mutagenesis with restriction sites that can accept oligonucleotides with AfHI 
and Bglll ends. Insertion of oligonucleotides with Aflll and Bglll ends will yield plasmids 
which encode proteins containing the MFal signal and leader sequences upstream of the 
sequence encoded by the oligonucleotides. The MFal signal and leader sequences should 
direct the processing of these precursor proteins through the pathway normally used for the 
transport of mature a-factor. 

The MFal gene, obtained as a 1.8 kb EcoRI fragment from pDA6300 (J. Thorner, 
Univ. of California) was cloned into pALTER in preparation for oligonucleotide-dirfccted 
mutagenesis to remove the coding region of mature a-factor while constructing sites for 
acceptance of oligonucleotides with Aflll and Bell ends. The mutagenesis was accomplished 
using the minus strand as template and the following mutagenic oligonucleotide: 



S'-CTAAAGAAGAAGGGGTATCTTTGCTTAAGCTCGAGATCTCGACTGATA- 



A Hindlll site was simultaneously introduced 7 nts upstream of the MFal start codon 
with the oligonucleotide: 



5' CATACACAATATAAAGCTTTAAAAGAATGAG-3' & ^'°) 

The resulting plasmid, Cadus 1214, contains a Hindlll site 7 nts upstream of the MFa 
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1 initiation codon, an Aflll site at the positions which encode the KEX2 processing site in the 
MFal leader peptide, and Xhol and Bglll sites in place of all sequences from the leader- 
encoding sequences up to and including the normal stop codon. The 1.5 kb Hindlll fragment 
of Cadus 1214 therefore provides a cloning site for oligonucleotides to be expressed in yeast 
and secreted through the pathway normally travelled by endogenous a-factor. 

A sequence comprising the ADC1 promoter and 5' flanking sequence was obtained as 
a 1.5 kb BamHI-Hindlll fragment from pAAH5 (Ammerer, G. (1983) Academic Press, Inc., 
Meth. Enzymol. 101, 192-201 and ligated into the high copy yeast plasmid pRS426 
(Christianson, T.W et al. (1992) Gene 1 10:1 19-122) (see Figure 1). The unique Xhol site in 
the resulting plasmid was eliminated to yield Cadus 1186. The 1.5 Kb Hindlll fragment of 
Cadus 1214 was inserted into HindlH-digested Cadus 1 1 86; expression of sequences cloned 
into this cassette initiates from the ADH1 promoter. The resulting plasmid, designated Cadus 
1215, can be prepared to accept oligonucleotides with Aflll and Bell ends by digestion with 
those restriction endonucleases. The oligonucleotides will be expressed in the context of MF 
al signal and leader peptides (Figure 2). 

Modified versions of Cadus 1215 were also constructed. To 30 improve the 
efficiency of ligation of oligonucleotides into the expression vector, Cadus 1215 was 
restricted with Kpnl and religated to yield Cadus 1337. This resulted in removalof one of 
two Hindlll sites. Cadus 1337 was linearized with Hindlll, filled-in, and recircularized to 
generate Cadus 1338. To further tailor the vector for library construction, the following 
double-stranded oligonucleotide was cloned into Aflll-and Bglll-digested Cadus 1338: 

£ 5' TTAAGCGTGAGGCAGAAGCTTATCGATA oligo 062^0^ & ^ 'j) 

6 3' CGCACTCCGTCTTCGAATAGCTATCTAG oligo 063 <b *° ' It) 

The Clal site is unique in the resulting vector, Cadus 1373. In Cadus 1373, the 
Hindlll site that exists at the junction between the MFa pro sequence and the mature peptide 
to be expressed by this vector was made unique. Therefore the Hindlll site and the 
downstream Bglll site can be used to insert oligonucleotides encoding peptides of interest 
These modifications of Cadus 1215 provide an laternative to the use of the Aflll site in the 
. cloning of oligonucleotides into the expressions vector. 

Cadus 1373 was altered further to permit elimination from restricted vector 
preparations of contaminating singly-cut plasmid. Such contamination could result in 
unacceptably high background transformation. To eliminate this possibility, approximately 
1 .1 kb of dispensable ADH1 sequence at the 5 f side of the promoter region was deleted. This 
was accomplished by restruction of Cadus 1373 with SphI and BamHI, fill-in, and ligation; 
this maneuver regenerates the BamHI site. The resulting vector, Cadus 1624, was then 
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restricted with Hindlll and Clal and an approximately 1.4 kb HindlH and Clal fragment 
encoding 25 lacZ was inserted to generate Cadus 1625. Use of HindlH- and Bglll-restricted 
Cadus 1625 for acceptance of oligonucleotides results in a low background upon 
transformation of the ligation product into bacteria. 

Two single-stranded oligonucleotide sequences (see below) are synthesized, 
annealed, and repetitively filled in, denatured, and reannealed to form double-stranded 
oligonucleotides that, when digested with Afill and Bell, can be ligated into the polylinker of 
the expression vector, Cadus 1215. The two single-stranded oligonucleotides have the 
following sequences: . ^ v 

5-G CTA CTT AAG CGT GAG GCA GAA GCT 3'* and /> v A 

5'-C GGA TGA TCA (NNN) n AGC TTC TGC CTC ACG CTT AAG TAG C 3^ 

where N is any chosen nucleotide and n is any chosen integer. Yeast transformed with the 
resulting plasmids will secrete — through the a-factor secretory pathway — peptides whose 
amino acid sequence is determined by the particular choice of N and n). 

Alternatively, the following single stranded oligonucleotides are used: 
MFaNNK (76 mer) : 



5'CTGGAIGCGAAGACAGCTNNKNNKNN^ 

NNK TGATCAGTCTGTGACGC 3' *° l{ V 

a. 

and MFaMbo (17 mer) : 

5' GCGTCACAGACTGATCA 3' (&Q^K>'- 

When annealed the double stranded region is: 

TGA 7XT AGTCTGTG ACGC^ (f(3X fh H>'*(T). 

AC TA (TTC AG AC ACTGCG A \t> • 1^) 

After fill-in using Taq DNA polymerase (Promega Corporation, Madison, 
Wisconsin), the double stranded product is restricted with Bbsl and Mbol and ligated to 
HindlH- and Bglll-restricted Cadus 1 373 . 

ii. The region of MFal which encodes mature a-factor will be replaced via single 
stranded mutagenesis with restriction sites that can accept oligonucleotides with Xhol and 
Aflll ends. Insertion of oligonucleotides with Xhol and Aflll ends will yield plasmids which 
encode proteins containing the MFal leader sequences upstream of the sequence encoded by 
the oligonucleotides. The MFal leader sequences should direct the processing of these 
precursor proteins through the pathway normally used for the transport of mature a-factor. 
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MFA1 obtained as a BamHI fragment from pKK! (prided by 1. 30 Thorner and K. 
Zer rttedin^eBan^sUeofpALTCR^ega). Ustag *= n^us stiand of 
^template, a HindHI site was inserted by oligonucleotide-directed mutagenes.s ^ust 5 
to the MFA1 start codon using the following oligonucleotide: Q^^^j 
A 5' CCAAAATAAGTACAAAGCTTTCGAATAGAAATGCAACCATC^ 

A second oligonucleotide was used simultaneously to introduce a short polymer for 
iater cloning of synthetic oligonucleotides in place of MFA1 sequent. These MFA1 

to the stop codon: - 
0 5'GCCGCTCCAAAAGAAAAGACCTCGAGCTCGCTTAAGTTCTGCGTACAAAAACG- 

* TTGTTC3' Cfett^M^ 

The 1 6 kb Hindlll fragment of the resulting plasmid, Cadus '172 — 
sequences ending the MFA1 start codon and the N-termina. 16 amino acids of the leade 
ZH, lowed by a short poiylinKer containing Xhol, Sac., and Af.ll sites for .nserfon of 
■ S™Ldes. Tbe 1.6 kb Hindlll fragment of Cadus U72 was hgated rnto H-flD- 
digested Cadus 1186 (see above) to place expression of sequences cloned tnto tins cassette 

control of the ADW promoter. The Sac, site in the polylinker « • 
eliminating a second Sad site present in the vector. The resulting plasmrd, ^ ^ Cadu, 
2 7L be prepared to accept oligonucleotides with Xhol and Aflll ends by dtgestion wtth 
Z'r^tion endonucleases for expression in the context of MFal leader peptides (Frgure 

Two single-stranded ohgonucleotide sequences (see below) are synthesized annealed *>d 
Z tZy filled in, denatured, and reannealed to form double-strande, ohgonucleotides 
Z Z « i- with Aflll and Bglll, can be cloned into the polylmker o me express,™ 
"d» -.239. Thetwosingle-strandedoiigonucleotidesusedformeclomnghaveme 

following sequences: ^^2,/) 
5- GG TAC TCG AGT GAA AAG AAG GAC AAC -J 
y CG TAC TTA AGC^AAT AAC ACA (NNN) „ GTT GTC CTT CTT TTC ACT CGA 

J GTACCS^*^^ - 

where N is any Chosen nucleotide and n is any chosen integer. 

Yeast transformed with the resulting plasmids will .a.spon -through ^e pa^way norn^y 
leTforTe export of a-factor - famesylated, carboxymethylated pephdes whose ammo add 
sequence is determined by the particular choice of N and n (Figure 3). 
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Example 4. Peptide Secretion/Transport. 
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This example demonstrates the ability to engineer yeast such that they secrete or 
transport oligonucleotide-encoded peptides (in this case their pheromones) through the 
pathways normally used for the secretion or transport of endogenous pheromones. 

Autocrine MATa strain CY588: 

A MATa strain designed for the expression of peptides in the context of MFal (i.e., 
using the MFal expression vector, Cadus 1215) has been constructed. The genotype of this 
strain, which we designate CY588, is MATa barl farUl fusl-HIS3 stel4::TRPl ura3 
trpl leu2 his3. The barl mutation eliminates the strain's ability to produce a protease that 
degrades a-factor and that may degrade some peptides encoded by the cloned 
oligonucleotides; the fdrl mutation abrogates the arrest of growth which normally follows 
stimulation of the pheromone response pathway; an integrated FUS1-HIS3 hybrid gene 
provides a selectable signal of activation of the pheromone response pathway; and, finally, 
the stel4 mutation lowers background of the FUS1-HIS3 readout. The enzymes responsible 
for processing of the MFal precursor in MATa cells are also expressed in MATa cells 
(Sprague and Thomer, in The Molecular and Cellular Biology of the Yeast Saccharomyces: 
Gene Expression, 1992, Cold Spring Harbor Press), therefore, CY588 cells should be able to 
secrete peptides encoded by oligonucleotides expressed from plasmid Cadus 1215. 

A high transforming version (tbt\-\) of CY588 was obtained by crossing CY1013 
(CY588 containing an episomal copy of the STE14 gene) (MATa barl::hisGfarl-l fusl-HIS3 
stel4::TRPl ura3 trpl leu2 his3 [STE14 URA3 CEN4) to CY793 (MATa- tbtl-1 ura3 leu2 
trpl his3 fusl-HIS2 canl stell4::TRPl [FUS1 LEU2 2\i]) and selecting from the resultant 
spores a strain possessing the same salient genotype described for CY588 (see above), and in 
addition the tbl-1 allele, which confers the capacity for very high efficiency transformation 
by electroporation. The selected strain is CY1455 (MATabarl;:hisGfarl-l fusl-HIS3 
stel4::TRPl tbt-1 ura3trplleu2his3). 

Secretion of peptides in the context of yeast a-factor: 

Experiments were performed to test: 1. the ability of Cadus 1215 to function as a 
vector for the expression of peptides encoded by synthetic oligonucleotides; 2. the suitability 
-of the oligonucleotides, as designed, to direct the secretion of peptides through the a-factor 
secretory pathway; 3. the capacity of CY588 to secrete those peptides; and 4. the ability of 
CY588 to respond to those peptides that stimulate the pheromone response pathway by 
growing on selective media. These experiments were performed using an oligonucleotide 
which encodes the 13 amino acid a-factor; i.e., the degenerate sequence (NNN) n in the 
oligonucleotide cloned into Cadus 1215 (see above) was specified (n=13) to encode this 
pheromone. CY588 was transformed with the resulting plasmid (Cadus 1219), and 
transformants selected on uracil-deficient medium were transferred to histidine-deficient 
medium supplemented with a range of concentrations of aminotriazole (an inhibitor of the 
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HIS3 gene product that serves to reduce background growth). The results demonstrate that 
the synthetic oligo-nucleotide, expressed in the context of MFal by Cadus 1215, conferred 
upon CY588 an ability to grow on histidine-deficient media supplemented with 
aminotriazole. In summation, these data indicate that: 1. CY588 is competent for the 
secretion of a peptide encoded by the (NNN) n sequence of the synthetic oligonucleotide 
cloned into and expressed from Cadus 1215; and 2. CY588 can, in an autocrine fashion, 
respond to a secreted peptide which stimulates its pheromone response pathway, in this case 
by a-factor binding to STE2. 

Additional experiments were performed to test the utility of autocrine yeast strains in 
identifying agonists of the Ste2 receptor from among members of two semi-random a-factor 
libraries, a-Mid-5 and MFcc-8. 

a-Mid-5 Library 

A library of semi-random peptides, termed the a-Mid-5 library, was constructed. In 
this library, the N-terminal four amino acids and the C-terminal four amino acids of a 13 
residue peptide are identical to those of native a-factor while the central five residues 
(residues 5-9) are encoded by the degenerate sequence (NNQ) 5 . The following 
oligonucleotides were used in the construction of the a-Mid-5 library: 

(1) MFaMbo, a 17 mer: 

5' GCGTC AC AG ACTG ATC A A ^ : ^ 

(2) MID5ALF, a 71 mer: 
5' 

GCCGTCAGTAMGC1TGGCATTGGTTGNNQNNQNNQNNQMMQCAGCCTATGTA 
CTGATC AGTCTGTGACGC _ C$&& fow>'-^ 

Sequenase (United States Biochemical Corporation, Cleveland, Ohio) was used to 
complete the duplex formed after annealing MFaMbo to the MID5ALF oligonucleotide. In 
the MID5ALF sequence, N indicates a mixture of A, C, G, and T at ratios of 0.8:1:1.3:1; Q 
indicates a mixture of C and G at a ratio of 1:1.3. These ratios were employed to compensate 
for the different coupling efficiences of the bases during oligonucleotide synthesis and were 
thus intended to normalize the appearance of all bases in the library. The double-stranded 
oligonucleotide was restricted with Hindlll and Mbol and ligated to Cadus 1625 (see above); 
Cadus 1625 had been prepared to accept the semi-random oligonucleotides by restriction 
with Hindlll and Bglll. 

The apparent complexity of the aMid-5 library is 1 x 1 0 7 . This complexity is based 
on the number of bacterial transformants obtained with the library DNA versus transformants 
obtained with control vector DNA that lacks insert. Sequence analysis of six clones from the 
library demonstrated that each contained a unique insert. 
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To identify peptide members of the a-mid-5 library that could act as agonists on the 
STE2 receptor, CY1455, a high transforming version of CY588, was electroporated to 
enhance uptake of a-Mid-5 DNA. Transformants were selected on uracil-deficient (-Ura) 
synthetic complete medium ' and were transferred to histidine-deficient (-His) synthetic 
complete medium supplemented with 0.5mM or ImM aminotriazole. 

Yeast able to grow on -His + aminotriazole medium include (1) yeast which are 
dependent on the expression of an a-factor variant agonist and (2) yeast which contain 
mutations that result in constitutive signalling along the pheromone pathway. Yeast 
expressing and secreting a variant STE2 receptor agonist have the ability to stimulate the 
growth on -His medium of surrounding CY 1455 cells that do not express such an agonist. 
Thus a recognizable formation (termed a "starburst") results, consisting of a central colony, 
growing by virtue of autocrine stimuation of the pheromone pathway, surrounded by satellite 
colonies, growing by virtue of paracrine stimulation of the pheromone pathway by the 
agonist peptide as that peptide diffuses radially from the central, secreting colony. 

In order to identify the peptide sequence responsible for this "starburst" phenomenon, 
yeast were transferred from the center of the "starburst" and streaks were made on -Ura 
medium to obtain single colonies. Individual clones from -Ura were tested for the His+ 
phenotype on -His + aminotriazole plates containing a sparse lawn of CY1455 cells. 
Autocrine yeast expressing a peptide agonist exhibited the "starburst 11 phenotype as the 
secreted agonist stimulated the growth of surrounding cells that lacked the peptide but were 
capable of responding to it. Constitutive pheromone pathway mutants were capable of 
growth on -His + aminotriazole but were incapable of enabling the growth of surrounding 
lawn cells. 

Alternatively, streaks of candidate autocrine yeast clones were made on plates 
containing 5-fluoroorotic acid (FOA) to obtain Ura segregants were retested on -His .+ 
aminotriazole for the loss of the His+ phenotype. Clones that lost the ability to grow on -His 
+ aminotriazole after selection on FOA (and loss of the peptide-encoding plasmid) derived 
from candidate expressors of a peptide agonist. The plasmid was rescued from candidate 
_ clones and the peptide sequences determined. In addition, a plasmid encoding a putative 
Ste2 agonist was reintroduced into CY1455 to confirm that the presence of the plasmid 
encoding the peptide agonist conferred the His+ phenotype to CY1455. 

By following the above protocol novel Ste2 agonists have been identified from the a-Mid-5 
library. Sequences of nine agonists follow, preceded by the sequence fo the native a-factor 
pheromone and by the oligonucleotide used to encode the native pheromone in these 
experiments. (Note the variant codons used in the a-factor-encoding oligonucleotide for 
glutamine and proline in the C-terminal amino acids of a-factor). 

a-factor TGG CAT TGG TTG CAG CTA AAA CCT GGC CAA CCA ATG TAC ' k : 2^) 
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encodes Trp His Trp Leu Gin Leu Lys Pro Gly Gin Pro Met Tyr &**> r 
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The nine peptide agonists of the Ste2 receptor above were derived from one 
electroporation of CY1455 using 1 ng of the a-Mid-5 library DNA. Approximately 3 x 10 5 
transformants were obtained, representing approximately .03% of the sequences present in 
that library. 



MFa-8 Library 

A semi-random a-factor library was obtained through synthesis of mutagenized ce- 
faclor oligonucleotides such that 1 in 10,000 peptide products were expected to be genuine a 
-factor. The mutagenesis was accomplished with doped synthesis of the oligonucleotides: 
each nucleotide was made approximately 68% accurate by synthesizing the following two 
oligos: 
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5' 



CTGG ATGCG AAG ACTC AGCT (20 mer) (oligo060) *7) 



5' CGG ATGATCA gta cat tgg ttg gcc agg ttt tag ctg caa cca atg cca AGC TGA GTC 
TTC GCATCC AG (69 mer) (oligo074) ^6*28 l*> WW 

The lower case letters indicate a mixture of 67% of that nucleotide and 1 1% of each of the 
other three nucleotides (e.g. t indicates 67% T and 11% A, 1 1% C, and 11% G). Note that 
digestion of the double-stranded oligonucleotide by Fokl or Bbsl will yield an identical 5' 
end that is compatible with Hindlll ends. 

Oligos 060 and 074 will form the following double-stranded molecule when annealed: 
5'-C TGGATG CG AAGACTCAGCT^^ < bwj) 

3 '-GACCTACGCTTCTG AGTCGA acc gta acc aac gtc gat ttt gga ccg gtt ggt tac atg 
ACTAGTAGGC-5 f C£eO>&>W$ 



The duplex was repetitively filled-in using Taq DNA polymerase (Promega 
Corporation, Madison, Wisconsin). The double-stranded product was restricted with Bbsl 
and Bell and ligated into Hindlll- and Bglll-digested Cadus 1373. The Bglll/Bcll joint 
creates a TGA stop codon for the termination of translation of the randomers. Using this 
approach,, the MFa-5.8 library (a library of apparent low complexity based on PCR analysis 
of oligonucleotide insert frequency) was constructed. 

To identify peptide members of the MFa-5.8 library that could act as agonists on the 
STE2 receptor, CY1455, a high transforming version of CY588, was electroporated to 
enhance uptake of MFa-5.8 DNA. Transformants were selected on uracil-deficient (-Ura) 
synthetic complete medium and were transferred to histidine-deficient (-His) synthetic 
complete medium supplemented with 1.0 mM or 2.5 mM aminotriazole. Yeast from 
colonies which were surrounded by satellite growth were transferred as streaks to -Ura 
medium to obtain single colonies. Yeast from single colonies wree then tested for the His+ 
phenotype on -His + aminotriazole plates. Sequence analysis of seven of the plasmids 
rescued from His+ yeast revealed three unique a-factor variants that acted as agonists on the 
STE2 receptor. 

\ A independent clones had the following sequence: (^33a^ U>: Sj) 

TGG CAT TGG CTA CAG CTA ACG CCT GGG CAA CCA ATG TAC 
encoding Trp His Trp Leu Gin Leu Thr Pro Gly Gin Pro Met Tyr^ 
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2.2 independent clones had the following sequence: ^ ' 

TGG CAT TGG CTG GAG CTT ATG CCT GGC CAA CCA TTA TAC 



encoding Trp His Trp Leu Glu Leu Met Pro Gly Gin Pro Leu Tyr 



3. TGG CAT TGG ATG GAG CTA AGA CCT GGC CAA CCA ATG TAC A 

encoding Trp His Trp Met Glu Leu Arq Pro Gly Gin Pro Met Tyr. t***** 1 ** 0 ^ 



Autocrine Mata strain CY5 99: 

A MATa strain designed for the expression of peptides in the context of MFA1 (i.e., 
using the MFA1 expression vector, Cadus. 1239) has been constructed. The genotype of this 
strain, designated CY599, is MATa mfal mfa2 farl-1 his3::fusl-HIS3 ste2-STE3 ura3 
metl adel leu2. In this strain, Cadus 1147 (see above) was used to replace STE2 with a 
hybrid gene in which the STE3 coding region is under the control of expression elements 
which normally drive the expression of STE2. As a result, the a-factor receptor replaces the 
a-factor receptor. The genes which encode a-factor are deleted from this strain; the farl 
mutation abrogates the arrest of growth which normally follows stimulation of the 
pheromone response pathway; and the FUS1-HIS3 hybrid gene (integrated at the HIS3 locus) 
provides a selectable signal of activation of the pheromone response pathway. CY599 cells 
were expected to be capable of the transport of a-factor or a-factor-like peptides encoded by 
oligonucleotides expressed from Cadus 1239 by virtue of expression of the endogenous yeast 
transporter, Ste6. 

Transport of peptides by the yeast a-factor pathway: 

Experiments were performed to test: 1. the ability of Cadus 1239 to function as a 
vector for the expression of peptides encoded by synthetic oligonucleotides; 2. the suitability 
of the oligonucleotides, as designed, to direct the export of farnesylated, carboxymethylated 
peptides through the pathway normally used by a-factor; 3. the capacity of CY599 to export 
these peptides; and 4. the ability of CY599 to respond to those peptides that stimulate the 
pheromone response pathway by growing on selective media. These tests were performed 
using an oligonucleotido_:which encodes the 12 amino acid a-factor; specifically, the 
degenerate sequence (NNN) n -in the oligonucleotide cloned into Cadus 1239 (see above) 
(with n=12) encodes the peptide component of a-factor pheromone. CY599 was transformed 
with the resulting plasmid (Cadus 1220), and transformants selected on uracil-deficient 
medium were transferred to histidine-deficient medium supplemented with a range of 
concentrations of aminotriazole. The results demonstrate that the synthetic oligonucleotide, 
expressed in the context of MFA1 by Cadus 1220, conferred upon CY599 enhanced 



-76 



ln summation, these data 

used to discover peptides xn*y transformed wvtn t,/^ 

• * Pxamole 2 above) will be u<u , _ of functional a 

-factor analogs- CYS99 I se_ ^ ^ of f^uon ^ 

B»«*»*«* ,4, ri k - «ow on histidine.de.. «- - 

the oligo- 

transformation will be exp be sequenC ed to d 

aeI »onstrate the potent of * J response pathway. 

designed such tot- the AflU an ^ ml srte ,„ the 5 en GAGA 

fature flexibility w.<h^8 ^ „^ a»4 ^ ^ ^ * ^ a^bed 
. rep cats which are presen , »^^ >0 acid , 

changed without altenng theenc W ^ V .^ 

abov e witl actually be constructed * QT0AGQC AGAAGCT agd 
CGTGAAGCTTAAGCGTGAUU ^^Aj^) 

*" y CGGATGATCAtMmuAGCnCTG, 



-77- 



where M is either A or C at a ratio of 40:60. The oligos will be annealed with one another 
and repetitively filled in, denatured, and reannealed (Kay et al, Gene, 1993). The double- 
stranded product will be cut with AflH and Bell and ligated into the Aflll- and Bglll-digested 
CADUS 1215. The Bglll/Bcll joint will create a TGA stop codon for termination of 
translation of the randomers. Because of the TA content of the Afl overhang, the oligos will 
be ligated to the Aflll-and Bglll-digested pADC-MFa at 4° C. 

Random oligonucleotides to be expressed by the expression plasmid CADUS 1239 
will encode monodecapeptides constructed as g&xfoto'Lo) 
5' GGTACT£GAGTGAAAAGAAGGACAAC(NNK) M ^ 

where N is any nucleotide, K is either T or G at a ratio of 40:60 (see Proc. Natl. Acad, set 
87:6378, 1990; ibid 89:5393, 1992). When cloned into the Xhol and Aflll sites of CADUS 
1239 the propeptides expressed under the control of the ADH1 promoter will contain the 
entire leader peptide of MFal, followed by 11 random amino acids, followed by triplets 
encoding CVIA (the C-terminal tetrapeptide of wild-type a-factor). Processing of the 
propeptide should result in the secretion of dodecapeptides which contain 1 1 random amino 
acids followed by a C-terminal, farnesylated, carboxymethylated cysteine. 

Using the procedure described above, the oligonucleotides for expression in CADUS 

1 239 will actually be constructed from the following two oligos; 

C^fo M>-<*0 

5' GGTACTCGAGTGAAAAGAAGGACAAC and . 
5' CGTACTTAAGCAATAACAca(MNN) H GTTGTCC, 

where M is either A or C at a ratio of 40:60, and the Xhol and Aflll sites are 
underlined. 

Discovery of ' a-factor analoques from a random peptide library 

An optimized version of strain 6 (Example 2 above) was derived. This yeast strain, 
CY2012 (MATa ste2-STE3farlM442 mfal::LEU2 mfa2-lacZfusl-HIS3 tbtl-1 ura3 leu2 his3 
trpl suc2), was constructed as follows. From a cross of CY570 (MATa mfal::LEU2 mfa.2- 
lacZ ura3 trpl his3*200 canl leu2 fusl-HIS3 [MFA1 URA3 2u] [FuslA8-73 TRP1 CEN6J) by 
CY1624 (MATa tbtl-1 fusl-HIS3 trpl ura3 leu2 his3 lys2-801 SUC+), a spore was selected 
(CY1877) of the following genotype: MATa mfal::LEU2 mfa2-lacZ fusl-HIS3 tbtl-1 ura3 
leu2 his3 trpl suc2. This strain lacks both genes (NFA1 and MFA2) encoding a-factor 
precursors, contains the appropriate pheromone pathway reporter gene (fusl-HIS3), and 
transforms by electroporation at high efficiency (tbtl-l). This strain was altered by deletion 
of the FAR1 gene (with Cadus 1442; see Example 6), and replacement of STE2 coding 
sequences with that of STE3 (see Example 1) to yield CY2012. 
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This strain was transformed with plasmid DNA from a random a-factor library by 
electroporation and plated on 17 synthetic complete plates lacking uracil (-Ura), yielding 
approximately 10 5 Ura+colonies per plate after 2 days at 30°C. These colonies were replica 
plated to histidine-deficient synthetic complete media (-His) containing 0.2 mM 3- 
aminotriazole and after three days at 30°C 35 His+ replicas were streaked to -Ura plates. The 
resultant colonies, 3 from each isolate, were retested for their His+ phenotype, and streaked 
to 5-fluoroorotic acid plates to obtain Ura segregants (lacking a library plasmid). Those Ura- . 
segregants were tested for the loss of their His+ phenotype. Ten of the original isolates 
passed these tests; in two cases only one of the three Ura+ colonies purified from the isolate 
retained the His+ phenotype, but nevertheless subsequently segregated Ura His- colonies. 

A single plasmid (corresponding to a bacterial colony) was obtained from each of the 
ten isolates, and reintroduced into CY2012. Eight of the ten plasmids passed the test of 
retaining the ability to confer the His-f phenotype on CY2012 (the two that failed correspond 
to the two isolates that were mentioned above, suggesting that these isolates contain at least 
one "irrelevant' plasmid). Sequencing of the randomized insert in the eight plasmids of 
interest revealed that four contain the sequence: v 

TAT GCT CTG TTT GTT CAT TTT TTT GAT ATT CCG 
Tyr Ala Leu Phe Val His Phe Phe Asp lie Pro A 

two contain the sequence: fagQub/W 
TTT AAG GGT* CAG GTG CGT TTT GTG GTT CTT GCT % 

Phe Lys Gly Gin Val Arg Phe Val Val Leu Ala, \ 

A 

and two contain the sequence: ■' _ \ 

CTT ATG TCT CCG TCT TTT TTT TTT TTG CCT GCC^ 
Leu Met Ser Pro Ser Phe Phe Phe Leu Pro Alc^ 

Clearly, these .sequences encode novel peptides, as the native a-factor sequence differs 
considerably: ' _ s \ 

} Tyr lie lie Lys Gly ValPheTrp Asp Pro Ala. 

The a-factor- variants identified from random peptide libraries have utility as 
"improved" substrates of ABC transporters expressed in yeast. For example, identification of 
a preferred substrate of human MDR, one that retains agonist activity on the pheromone 
receptor, would permit the establishment of robust yeast screens to be used in the discovery 
of compounds that affect transporter function. 



Example 6: Functional Expression of a Mammalian G Protein-Coupled Receptor and 
Ligand in an Autocrine Yeast Strain. 
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This example details the following: (1) expression of human C5a receptor in yeast; 
(2) expression of the native Hgand of this receptor, human C5a, in yeast; and (3) activation of 
the endogenous yeast pheromone pathway upon stimulation of the C5a receptor by C5a when 
both of these molecules are expressed within the same strain of autocrine yeast. Following 
the experimental data we outline the utility of autocrine strains of yeast that functionally 
express the human C5a receptor. 

Human C5a is a 74 amino acid polypeptide that derives from the fifth component of 
complement during activation of the complement cascade; it is the most potent of the 
complement-derived anaphylatoxins. C5a is a powerful activator of neutrophils and 
macrophage functions including production of cytotoxic super oxide radicals and induction 
of chemotaxis and adhesiveness. In addition C5a stimulates smooth muscle contraction, 
induces degranulation of mast cells, induces serotonin release from platelets and increases 
vascular permeability. The C5a anaphylatoxin can also amplify the inflammatory response 
by stimulating the production of cytokines. As C5a is a highly potent inflammatory agent, it 
is a primary target for the development of antagonists to be used for intervention in a variety 
of inflammatory processes. 

The C5a receptor is present on neutrophils, mast cells, macrophages and smooth 
muscle cells and couples through G proteins to transmit signals initiated through the binding 
of C5a. 

Expression of the C5a Receptor 

The plasmid pCDM8-C5aRc, bearing cDN A sequence encoding the human C5a 
receptor, was obtained from N. Gerard and C. Gerard (Harvard Medical School, Boston, MA) 
(Gerard and Gerard 1991). Sequence encoding C5a was derived from this plasmid by PCR 
using VENT polymerase (New England Biolabs Inc., Beverly MA), and the following 
primers: 

#1-GGTGGGAGGGTGCTCTCTAGAAGGAAGTGTTCACC A 6^ ih^Ol ?£>) 
#2-GCCCAGGAGACCAGACCAIGGACT^ 

Primer #1 contains a single base-pair mismatch (underlined) to C5a receptor cDNA. It 
introduces an Xbal site (in bold) 201 bp downstream from the TAG termination codon of the 
C5a receptor coding sequence. Primer #2 contains two mismatched bases and serves to 
create an Ncol site (in bold) surrounding the ATG initiator codon (double underlined). The 
second amino acid is changed from an aspartic acid to an asparagine residue. This is the only 
change in primary amino acid sequence from the wild type human C5a receptor. 

The PCR product was restricted with Ncol and Xbal (sites in bold) and cloned into 



CADUS 1002 (YEpSlNco), a Gal 10 promoter expression vector. The sequence of the entire 
insert was determined by dideoxy sequencing using multiple primers. The sequence between 
the Ncol and Xbal sites was found to be identical to the human C5a receptor sequence that 
was deposited in GenBank (accession #J05327) with the exception of those changes encoded 
by the PCR primers. The C5a receptor-encoding insert was transferred to CADUS 1289 
(pLPXt), a PGK promoter expression vector, using the Ncol and Xbal sites, to generate the 
C5a receptor yeast expression clone, CADUS 1303. 

A version of the C5a receptor which contains a yeast invertase signal sequence and a 
myc epitope tag at its amino terminus was expressed in Cadus 1270-transferred yeast under 
control of a GAL 10 promoter. Plasmids encoding an untagged version of the C5a receptor 
and a myc-tagged derivative of FUS1 served as controls. The expression of the tagged 
receptor in yeast was confirmed by Western blot using the anti-myc monoclonal antibody 
9E10. In the lane containing the extract from the Cadus 1270-transformant, the protein that 
is reactive with the anti-myc monoclonal antibody 9E10 was approximately 40 kD in size, as 
expected. Note that this receptor construct is not identical to the one used in the autocrine 
activation experiments. That receptor is not tagged, does not contain a signal sequence and is 
driven by the PGK promoter. 

Expression of the Ligand, C5a 

A synthetic construct of the sequence encoding C5a was obtained from C. Gerard 
.(Harvard Medical School, Boston, MA). This synthetic gene had been designed as a FLAG- 
tagged molecule for the secretion from E. coli (Gerard and Gerard (1990) Biochemistry 
29:9274-9281). The C5a coding region, still containing E. coli codon bias, was amplified 
using VENT polymerase (New England Biolabs Inc., Beverly MA) through 30 cycles using 
the following primers: 

C5a5' = CCCCTTAAGCGTGAGGCAGAAGCTACTCTGCAAAAGAAGATC Cf^** ' 
C5a3' « GAAGATCTTC AGCGGCCGAGTTGGATGTC Cf/Ztk /h^/V»7* ) 

A PCR product of 257 bp was gel isolated, restricted with Ailll and Bglll, and cloned 
into CADUS 1215 (an expression vector designed to express peptide sequences in the context 
of Mfa) to yield CADUS 1297. The regions of homology to the synthetic C5a gene are 
underlined. The 5' primer also contains pre-pro -a-factor sequence.. Upon translation and 
processing of the pre-pro a-factor sequence, authentic human C5a should be secreted by 
yeast containing CADUS 1297. The insert sequence in CADUS 1297 was sequenced in both 
orientations by the dideoxy method and found to be identical to that predicted by the PCR 
primers and the published sequence of the synthetic C5a gene (Franke et al. (1988) Methods 
in Enzymology 162: 653-668). 
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Two sets of experiments, aside from the autocrine activation of yeast detailed below, 
demonstrated that CADUS 1297 can be used to express C5a in yeast. 

1) . C5a was immunologically detected in both culture supernatant and lysed cells 
using a commercially available enzyme-linked immunosorbent assay (ELISA)(Table 1). 
This assay indicated the concentration of C5a in the culture supernatant to be approximately 
50 to 100 nM. In comparison, in data derived from mammalian cells, the binding constant of 
C5a to its receptor is'l nM (Boulay et al.(1991) Biochemistry 30:2993-2999. 

2) . C5a expressed in yeast was shown to compete for binding with commercially 
obtained (Amersham Corporation, Arlington Heights, IL), radiolabeled C5a on induced 
HL60 cells. 

Activation of the Pheromone Response Pathway in Autocrine Yeast Expressing the Human 
C5a Receptor and Human C5a 

Activation of the yeast pheromone response pathway through the interaction of C5a 
with the C5a receptor was demonstrated using a growth read-out. The strain used for this 
analysis, CY455 (MATa tbtl-1 ura3 leu2.trpl his3 fusl-HIS3 canl stel4::TRPl ste3*1156) 
contains the following significant modifications. A pheromone inducible HIS3 gene , fusl- 
HIS3, is integrated at the Fusl locus. A hybrid gene containing sequence encoding the first 
41 amino acids of GPA1 (the yeast Got subunit) fused to sequence encoding human God2a 
(minus codons for the N-terminal 33 amino acids) replaces GP A 1 at its normal chromosomal 
location. The yeast STE14 gene is disrupted to lower the basal level of signaling through the 
pheromone response pathway. The yeast a-factor receptor gene, STE3, is deleted. The last 
two modifications are probably not essential, but appear to improve the signal-to-noise ratio. 

CY455 (MATa tbtl-1 ura3 leu2 trpl his3 fusl-HIS3 canl stel4::TRPl ste3*1156) 
was transformed with the following plasmids: 

Cadus 1289 + Cadus 1215 = Receptor Ligand"= (R-L-) 
Cadus 1 303 + Cadus 1215 = Receptor" 1 " Ligand- = R+L- 
Cadus 1289 + Cadus 1 297 = Receptor Ligand + = (R-L+) 
Cadus 1303 + Cadus 1297 - Receptor* Ligand +«= (R+L+) 
Receptor refers to the human C5a receptor. 
Ligand refers to human C5a*_ - ■ - _ ■ 

Three coloni s were picked from each transformation and grown overnight in media 

lacking leucine and uracil, at pH 6.8 with 25 mM PIPES (LEU URA pH6.8 with 25 mM 
PIPES). This media was made by adding 0.45 ml of sterile 1M KOH and 2.5 ml of sterile 
1M PIPES pH 6.8 to 100 ml of standard SD LEU- URA- media. After overnight growth the 
pH of this media is usually acidified to approximately pH 5.5. Overnight cultures were 



washed once with 25 mM PIPES pH 6.8 and resuspended in an equal volume of media 
lacking leucine, uracil and histidine (LEU URA HIS pH 6.8 with 25 mM PIPES). The 
optical density at 600 nm of a 1/20 dilution of these cultures was determined and the cultures 
were diluted into 25 mM PIPES pH 6.8 to a final ODgQO of °- 2 - A volume (5ul) of this 
dilution equivalent to 10,000 cells was spotted onto selective (HIS+ TRP- pH6.8) plates. 
Only those strains expressing both C5a and its receptor (R+L+) show growth on the selective 
plates which lack histidine. All test strains are capable of growth on plates containing 
histidine. The R+L+ strain will grow on plates containing up to 5 mM aminotriazole, the 
highest concentration tested. 

For verification of pheromone pathway activation and quantification of the 
stimulation, the activity of the fusl promoter was determined colorometrically using a fusl- 
lacZ fusion in a similar set of strains. CY878 (MATct tbtl-1 fusl-HIS3 caNl 
stel4::trpl::LYS2 ste3*1156 gpal(41)-Gai2) was used as the starting strain for these 
experiments. This strain is a trpl derivative of CY455. The transformants for this 
experiment contained CADUS 1584 (pRS424-fusl-lacZ) in addition to the receptor and 
ligand plasmids. Four strains were grown overnight in SD LEU URA TRP pH6.8 with 
50mM PIPES to an OD 600 of less than 0.8. Assay of p-galactosidase activity (Guarente 
1983) in these strains yields the data shown in Figure 4. The coupling of the C5a receptor to 
Ga chimeras is shown in Table 2. 

■K 

Uses of the Autocrine C5a Strains: 

A primary use of the autocrine C5a strains will be in the discovery of C5a 
antagonists. Inhibitors of the biological function of G5a would be expected to protect against 
tissue damage resulting from inflammation in a wide variety of inflammatory disease 
processes including but not limited to: respiratory distress syndrome (Duchateau et al. (1984) . 
Am Rev Respir Dis 130:1058); (Hammerschmidt et al. (1980) Lancet 1:947), septic lung 
injury (Olson et al. 1985) Ann Surg 202:771), arthritis (Baneijee etal. (1989) J. Immuinol 
142:2237), ischemic and post-ischemic myocardial injury (Weisman (1990) Science 
146:249); (Crawford et al. (1988) Circulation 78:1449) and burn injury (Gelfand et al. (1982) 
J. Clin Invest 70:1 170). 

The autocrine C5a system as described can be used to isolate C5a antagonists as 
follows: 

1. High throughput screens to identify agonists of the C5a receptor. 

Figure 5 illustrates an exemplary set of steps for isolating surrogate ligands for the 
C5a receptor by the subject autocrine SSCL™ method. As described above, yeast cells were 
engineered to express human C5a receptor under conditions whereby the receptor is 



functionally coupled to a fusl:his3 reporter gene construct. The cells are transformed with a 
library encoding random peptides (supra) and plated on selective (His-) media. In the first 
round of screening (see Figure 5) yeast colonies are isolated by their ability to grow on the 
histidine deficient plates. In order to distinguish growth due to real receptor agonists, as 
opposed to revertants of the histidine auxotroph, DNA was extracted from the colonies 
isolated in the first round, amplified in E co/j, and transformed back into the engineered yeast 
cells and plated on His- plates. High frequency of transformation or "jackpots" of cell 
growth in the the second round indicates plasmids encoding genuine receptor agonists; 
individual colonies were picked, plasmid DNA isolated, amplified in E. coli, and the 
sequence of the surrogate ligand deduced from the DNA sequence corresponding peptide- 
encoding region of each isolated plasmid. 

After sequencing and deducing the amino acid sequence of the encoded surrogate 
ligands identified in the histidine auxotrophy rescue assay described above, individual 
peptides were chemically synthesized, dissolved in DMF, and spotted on a lawn of the 
engineered yeast cells. As illustrated in Figure 6, C5a receptor agonists result in growth of 
cells around the areas were a peptide was spotted. Figure 7 shows the amino acid sequence 
for C5a surrogate agonist peptides obtained by the above method. Interestingly, the isolates 
do not show extensive sequence homology to one and other, though several duplicate isolates 
were found amongst different transformants. 

Using the fus 1 :lacZ reporter gene construct described above, yeast cells engineered to 
express the human C5a receptor were stimulated with synthetic C5a surrogate peptides at 
varying concentrations. Figure 8 shows the dose response curve for various of the surrogate 
peptide ligands using on a colorimetric lacZ readout. 

The activity of the surrogate ligands were subsequently tested, as shown in Figure 9, 
by contacting the mammalian cell-line HEK293, which has been further engineered to 
provide a C5a receptor coupled with a CRE:lacZ reporter gene construct, with chemically 
synthesized versions of the peptides identified as C5a ligands by the autocrine method 
described above. _ 

To further improve^he selectivity and/or potency of the agonists identified by the 
above steps, we selected a surrogate peptide (peptide 122, see Figure 7), and created 
degenerate peptide libraries based on the sequence of that peptide as a starting point (e.g., a 
semi-random library) as follows: 

CSa pepl22 Tyr-Thr-Arg-Gly-Trp-Lye-Ala-Arg-Leu-I.eu-Trp-Leu-Ile ' 
sub-libraries ^ . 

N-term Xaa-Xaa-Xaa-Xaa-Trp-Lys-Ala-Arg-Leu-Leu-Trp-Leu- Il^^Sfea ibMl^ 
mid4 Tyr-Thr-Arg-Gly-Xaa-Xaa-Xaa-Xaa-Leu-Leu-Trp-Leu-Ile(^t3l»>*«t?c) 
C-term Tyr-Thr-Arg-Gly-Trp-Lys-Ala-Arg-Xaa-Xaa-Xaa-Xaa-Xaa I***'* TV 
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$ modi Xaa - Thr - Arg - Xaa - Trp - Lys -Xaa -Arg - Leu -Xaa - Trp - Leu - Xaa ||)a<) * 

mod2 Tyr-Xaa-Arg-Gly-Xaa-Lys-Ala-Xaa-Leu-Leu-Xaa-Leu-Ile^^/J)^! j 
mod3 Tyr-Thr-Xaa-Gly-Trp-Xaa-;aa-Arg^ 

Following the protocols set out above for the first generation peptide library, the second 
generation peptide library was screened, and individual clones isolated based on their ability 
to stimulate C5a receptor dependent transcription. 

The amino acid sequence for an isolate from the second generation peptide library, 
designatedhe^dn^^modl-5, was deduced to be Asp-Thr-Arg-Ser-Trp-Lys-Leu-Arg-Leu- 
Leu-Tip-Leu-Ala. As illustrated by Figure 10, the 122modl-5 peptide was chemically 
synthesized and contacted with a yeast cell engineered with a human C5a receptor and 
fusl:lacZ reporter gene construct. The activity of the peptide (122modl-5), determined by 
its ability to stimulate expression of the lacZ gene, was compared with the original peptide 
(122) used to generate the second generation peptide library and two other C5a receptor 
agonists. See Koneatis et al. (1994) J Immunol 153:4200. 

2. Identification of antagonists of C5a. 
In another embodiment, replacement of the fusl-HIS3 read-out with one of several 

negative selection schemes (fusl-URA3/FOA, fusl-GALl /galactose or deoxygalactose, Farl 
sst2 or other mutations that render yeast supersensitive for growth arrest) would generate a 
test system in which the presence of an antagonist would result in the growth of the assay 
strain. Such an approach would be applicable to high-throughput screening of compounds as 
well as to the selection of antagonists from random peptide libraries expressed in autocrine 
yeast. Optimization of screens of this type would involve screening the R+L+ strain at a 
concentration of aminotriazole which ablates growth of the R+L- strain (we are currently 
using 0.6 to 0.8 mM) and counterscreening the R+L- strain at a concentration of 
aminotriazole which gives an identical growth rate (we are using 0.14 mM), In addition, the 
system could employ one of several colorometric, fluorescent or chemiluminescent readouts. 
Some of the genes which can be fused to the fusl promoter for these alternate read-outs 
- include lacZ (colorometric -and-fluorescent substrates), glucuronidase 20 (colorometric and ~ 
fluorescent substrates), phosphatases (e.g. PH03, PH05, alkaline phosphatase; colorometric 
and chemiuminescent substrates), green protein (endogenous fluorescence), horse radish 
peroxidase (colorometric), luciferase (chemiluminescence). 

The autocrine C5a strains have further utility as follows: 

3. In the identification of novel C5a agonists from random peptide libraries expressed in 
autocrine veast. 

Novel peptide agonists would contribute to structure/function analyses used to guide 



the rational design of C5a antagonists. 



4. In the identification of receptor mutants. 

Constitutively active, that is, ligand independent, receptors may be selected from 
highly mutagenized populations by growth on selective media. These constitutively active 
receptors may have utility in permitting the mapping of the sites of interaction between the 
receptor and the G-protein. Identification of those sites may be important to the rational 
design of drugs to block that interaction. In addition, receptors could be selected for an 
ability to be stimulated by some agonists but not others or to be resistant to antagonist. 
These variant receptors would aid in mapping sites of interaction between receptor and 
agonist or antagonist and would therefore contribute to rational drug design efforts. 

5. In the identification of molecules that interact with Gai2. 

Compounds or peptides which directly inhibit GDP exchange from Gcci2 would have 
the same effect as C5a antagonists in these assays. Additional information would distinguish 
inhibitors of GDP exchange from C5a antagonists. This information could be obtained 
through assays that determine the following: 

1 . inhibition by test compounds of Gai2 activation from other receptors, 

2. failure of test compounds to compete with radiolabeled C5a for binding to the C5a 
receptor, 

3. failure of test compounds to inhibit the activation of other Got subunits by C5a, 

and 

4. inhibition by test compounds of signalling from constitutively , active versions of 
C5a, or other, receptors. 

Example 7: Construction of Xybrid Got Genes Construction of two sets of chimeric 
yeast/mammalian Get genes, GPA 41 -Ga and GPAI Bam -Ga. 

The Ga subunit of heterotrimeric G proteins must interact with both the Py complex and the 
receptor. Since the domains of Ga required for each of these interactions have not been 
completely defined and since our final goal requires Ga proteins that communicate with a 
mammalian receptor on one hand and the yeast Py subunits on the other, we desired to derive 
human-yeast chim ric Ga proteins with an optimized ability to perform both functions. 
From the studies reported here we determined that inclusion of only a small portion of the 
amino terminus of yeast Ga is required to couple a mammalian Ga protein to the yeast py 
subunits. It was anticipated that a further benefit to using these limited chimeras was the 
preservation of the entire mammalian domain of the Ga protein believed to be involved in 
receptor contact and interaction. Thus the likelihood that these chimeras would retain their 
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was expected to be quite high. 

Plasmid constructions. 

pRS416-(7iMJ (Cadus 1069). An Xbal - Sad fragment encoding the entire GPA1 
promoter region, coding region and approximately 250 nucleotides of 3* untranslated region 
was excised from 10 YCplacll 1-GPA1 (from S. Reed, Scripps Institute) and cloned into 
YEp vector pRS416 (Sikorski and Hieter, Genetics 122: 19 (1989)) cut with Xbal and Sad. 

Site-directed mutagenesis of GPA1 (Cadus 1075, 1121 and 1122). A 1.9 kb EcoRI 
fragment containing the entire GPA1 coding region and 200 nucleotides from the 5' 
untranslated region was cloned into EcoRI cut, phosphatase-treated p ALTER- 1 (Promega) 
and transformed by electroporation (Biorad Gene Rulser) into DH5aF' bacteria to yield 
Cadus 1075. Recombinant phagemids were rescued with M13K07 helper phage and single 
stranded recombinant DNA was extracted and purified according to the manufacturer's 
specifications. A new Ncol site was introduced at the initiator methionine of GPA1 by 
oligonucleotide directed mutagenesis using the synthetic oligonucleotide: 

5* GATATATTAAGGTAGGAAA CCATGG GGTGTACAGTGAG 3' 

A, 1 

Positive clones were selected in ampicillin and several independent clones were sequenced in 
both directions across the new Ncol site at +1. Two clones containing the correct sequences 
were retained as Cadus 1121 and 1 122. 

Construction of a G/Mi-based expression vector (Cadus 1127). The vector used 
for expression of full length and hybrid mammalian Ga proteins in yeast, Cadus 1 127, was 
constructed in the following manner. A 350 nucleotide- fragment spanning the 3' untranslated 
region of GPA1 was amplified with Taq polymerase (AmpliTaq; Perkin Elmer^gjn^tj^,^ 
oligonucleotide primers A (5 1 CGAGGCTCGAGGGAACG^^^^AAAGTAGTG 3')^ ^ 
and B (5* GCGCGGTACCAAGCTTCAATTCGAGATAATACCC 3'^° The 350 nucleotide 
product was purified by gel electrophoresis using GeneClean II (BiolOl) and was cloned 
directly into the pCRII vector by single nucleotide overlap TA cloning (InVitrogen). 
Recombinant clones were characterized by restriction enzyme mapping and by 
dideoxynucleotide sequencing. Recombinant clones contained a novel Xhol site 5' to the 
authentic GPA1 sequence and a novel Kpnl site 3' to the authentic GPA1 sequence donated 
respectively by primer A and primer B/ 

The NotI and SacI sites in the poly linker of Cadus 1013 (pRS414) were removed by 
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restriction with these enzymes followed by filling in with the Klenow fragment of DNA 
polymerase I and blunt end ligation to yield Cadus 1092. The 1.4 kb PstI - EcoRI 5* 
fragment of GPA1 from YCplaclll- GPA1 containing the GPA1. promoter and 5' 
untranslated region of GPA1 was purified by gel electrophoresis using GeneClean (BiolOl) 
and cloned into PstI - EcoRI restricted Cadus 1013 to yield Cadus 1087. The PCR amplified 
Xhol - Kpnl fragment encoding the 3' untranslated region of GPAJ was excised from Cadus 
1089 and cloned into Xhol - Kpnl restricted Cadus 1087 to yield Cadus 1092. The Notl 
and Sacl sites in the polylinker of Cadus 1092 were removed by restriction with these 
enzymes, filling in with the Klenow fragment of DNA polymerase I, and blunt end ligation to 
yield Cadus 1110. The region of Cadus 1 122 encoding the region of GPA1 from the EcoRI 
site at -200 to +120 was amplified with Vent DNA polymerase (New England Biolabs, 
Beverly, MA) with the primers 

5' CCCGAATCCACCAATTTCTTTACG 3 JP<^, 0^.-34) 
5' GCGGCGTCGACGCGGCCGCGTAACAGT 3\ 

A 

The amplified product, bearing an EcoRI site at its 5' end and novel Sacl, Notl and 
Sail sites at its 3' end was restricted with EcoRI and Sail, gel purified using GeneClean II 
(BiolOl), and cloned into EcoRI and Sail restricted Cadus 1110 to yield Cadus 1127. The 
DNA sequence of the vector between the EcoRI site at -200 and the Kpnl site at the 3' end of 
the 3' untranslated region was verified by restriction enzyme mapping and dideoxynucleotide 
DNA sequence analysis. 

PCR amplification of GPA^-Ga proteins and cloning into Cadus 1127. cDNA 
clones encoding the human G alpha subunits Gas, Gai2, Gai3, and S. cerevisiae GPA1 were 
amplified with Vent thermostable polymerase (New England Bioloabs, Beverly, MA). The 
primer pairs used in the amplification are as follows: &ftx.ll>A-6'f?) 
GctS Primer 1 : 5'CTGCTGGAGCTCCGCCTGCTGCTGCTGGGTGCTGGAG3' (Sacl 5') K 

Primer 2 : S'CTGCTGGTCGACGCGGCCGCGGGGGTTCCTTCTTAGAAGCAGCS' 

(SalI3') A 6^-^^ o: ^ CSL to**'? ) 

Primer 3 : 5'GGGCTCGAGCCTTCTTAGAGCAGCTCGTAC3' (XholT? ' 

\ 

Goti2 Primer 1 : S'CTGCTGGAGCTCAAGTTGCTGCTGTTGGGTGCTGGGGS' (SacIS')^ ' 

Primer 2 : 5'CTGCTGGTCGACGCGGCCGCGCCCCTCAGAAGAGGCCGCGGT 
CC3' (Sail 3') A C*^Vt> 

Primer 3 : 5'GGGCTCGAGCCT CAGAAGAGGCCGCAGTC3' (Xhol 3'), 

Gai3 Primer 1: 5'CTGCTGGAGCTCAAGCTGCTGCTACTCGGTGCTGGAG3' (SacIS 1 )^ 
Primer 2 : 5'CTGCTGGTCGACGCGGCCGCCACTAACATCCATGCTTCTCAAT 
AAAGTC3' (Sail 3') A £ 56 '' a ^^ ^Hj , . 
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After amplification, products were purified by gel electrophoresis using GeneClean II 
(Biol 01) and were cleaved with the appropriate restriction enzymes for cloning into Cadus 



The hybrid GPA 4r G a subunits were cloned via a Sad site introduced at the desired 
position near the 5 ! end of the amplified genes and a Sail or Xhol site introduced in the 3 ! 
untranslated region. Ligation mixtures were electroporated into competent bacteria and 
plasmid DNA was prepared from 50 cultures of ampicillin resistant bacteria. 

Construction of Integrating Vectors Encoding GPA 41 jG a Subunits. The coding region of 
each GPA^-Ga hybrid was cloned into an integrating vector (pRS406 = URA3 AmpR) using 
the BssHII sites flanking the polylinker cloning sites in this plasmid. Cadus 1011 (pRS406) 
was restricted with BssHII, treated with shrimp alkaline phosphatase as per the 
manufacturer's specifications, and the linearized vector was purified by gel electrophoresis. 
Inserts from each of the GPA 41 -G a hybrids were excised with BssHII from the parental 

plasmid, and subcloned into gel purified Cadus 1011. 

Construction of GPA BAM -Ga Constructs. A novel BamHI site was introduced in 
.frame into the GPA1 coding region by PCR amplification using Cadus 1179 (encoding a 
wildtype GPA1 allele with a novel Ncol site at the initiator methionine) as the template, 
VENT . polymerase, and the following primers: Primer A = 5' 



GCATCCATCAAIAATCCAG^j 1 ' ana Primer B = 5' GAAACAATGGA - 



TCCACTTCTTAC 3'. The 1.1 kb PCR product was gel purified with GeneClean II 
(Biol 01), restricted with Ncol and BamHI and cloned into NcoI-BamHI cut and 
phosphatased Cadus 1122 to yield Cadus 1605. The sequence of Cadus 1605 was verified by 
restriction analysis and dideoxy-sequencing of double-stranded templates. Recombinant 
GPA Bam -Ga hybrids of Gas, Gai2, and Gal 6 were generated. Construction of Cadus 1855 
encoding recombinant GPA Bam -Ga 16 serves as a master example: construction of the other 
hybrids followed an analogous cloning strategy. The parental plasmid Cadus 1617, encoding 
*iative Gal 6, was restricted with Ncol and BamHI, treated with shrimp alkaline phosphatase 
as per the manufacturer's specifications and the linearized vector was purified by gel 
electrophoresis: Cadus 1605 was restricted with Ncol and BamHI and the 1.1 kb fragment 
encoding the amino terminal 60% of GPA1 with a novel BamHI site at the 3' end was cloned 
into the Ncol- and BamHI-restricted Cadus 1617. The resulting plasmid encoding the 
GPA Bam -Ga 16 hybrid was verified by restriction analysis and assayed in tester strains ror an 
ability to couple to yeast G(5y and thereby suppress the gpaJ null phenotype. Two additional 
GPA Bam -Ga hybrids, GPA Bam -Gas and GPA Bam -Gai2, described in this application were 
prepared in an analogous manner using Cadusl606 as. the parental plasmid for the 



1127. 
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construction of the GPA Bam -Ga i2 hybrid and Cadus 1181 as the parental plasmid for the 
construction of the GPA Bam -Ga s hybrid. 

Coupling by chimeric Ga proteins. The Ga chimeras described above were tested for the 
ability to couple a mammalian G protein-coupled receptor to the pheromone response 
pathway in yeast. The results of these experiments are outlined in Table 3. Results obtained 
using GPA1 41 -Gai2.tp couple the human C5a receptor to the pheromone response pathway in 
autocrine strains of yeast are disclosed in above. 

Example 8: Screening for Modulators of G-alpha activity 

Screens for modulators of Ga activity may also be performed as shown in the 
following examples for illustration purposes, which are intended to be non-limiting. 

Strains CY4874 and CY4877 are isogenic but for the presence of Q205L mutation in 
the cloned Ga^ gene cloned into plasmid 1. Strains CY4901 and CY4904 each have a 
chromosomally integrated chimeric Ga fusion comprising 41 amino acids of gpal at the N 
terminus of the human Ga i2 gene and are isogenic but for the presence of a constitutively 
activating mutation in the C5a receptor gene of CY4901. Strain CY5058 is a gpal mutant 
which carries only the yeast Gpy subunits and no Ga subunit. This strain is a control strain 
to demonstrate specificity of action on the Ga subunit. 



I. Suppression of Activation by Mutation of Ga 

The Q205L mutation is a constitutively activated GTPase deficient mutant of the humafla a 
gene. Antagonist compounds, chemicals or other substances which act on Ga a can be 
recognized by their action to reduce the level of activation and thus reduce the signal from 
the fusl-lacZ reporter gene on the second plasmid (Plasmid 2). 

A. GTPase Ga i2 Mutants 

test component = gna^-Ga^ (Q 2 os L ) 
control component = gpa 41 -Ga a 

As well as the CY4874 and CY4877 constructs detailed above, similar strains with fusl-His3 
or fus2-CAN-l growth readouts may also be used. The fusl-His3 strains are preferred for 
screening for agonists and the fus2-CANl strains are preferred for antagonist screens. 



Readout 
fusl-fflS3 



fusl-lacZ 
fus2-CANl 



test 

strain 

CY4868 



CY4874 
CY4892 



effect of Ga i7 antagonist 

inhibit growth of -HIS 
+AT (Aminotriazole) 

reduce p-gal activity 

induce growth on 
canavanine , 



. control 
strain 
CY4871 



CY4877 
CY4386 



In each case an antagonist should cause the test strain to 
behave more like the control strain. 



Readout 

fusl-HIS3 
fusl-lacZ 
fus2-CANl 



B. GTPase Goc s Mutants (Ga Specificity) 

_ _ test component = Ga s (Q22 7 L) 
control component = Ga s 
test effect of Ga L i antagonist 

strain 

CY4880 none 
CY4886 none 



control 
strain 

CY4883 

CY4889 

CY4898 



CY4895 none 
In each case a non-specific antagonist would cause the test strain to behave more like the 

control strain. 
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Additional media requirements: -TRP for Ga plasmid maintenance in fusl-HIS3 arid 
fus2-CANl screens and -TRP -URA for Ga and fusl-lacZ plasmid maintenance in fusl-lacZ 
screen. 

II. Suppression of Activation by Receptors 

Constitutively Activated C5a Receptors 

test component = C5aR* (? m L, activated C5a Receptor) 
control component = C5aR 

The C5aR* mutation has a Leucine residue in place of the Proline residue of the wild-type at 
position 184 of the amino acid sequence. 

effect of Ga i2 antagonist control 

strain 

inhibit growth of -HIS CY2246 
+AT (Aminotriazole) 

fusl-lacZ GY4901 reduce p-gal activity , CY4904 * 

fus2-CANl CY4365 induce growth on CY4362 * 

canavanine 

In each case an antagonist should cause the test strain to . 
behave more like the control strain. 

Additional , media requirements: -LEU for receptor plasmid maintenance in fusl-HIS3 and 
fus2-CANl screens and -LEU -URA for receptor and fusl-lacZ plasmid maintenance in 
fiisl-lacZ screen, non-buffered yeast media (pH 5.5). 

Example 9: Identification of a surrogate ligand using expression of a random peptide 
library in yeast expressing an orphan mammalian receptor 

FPRL-1 (formyl peptide receptor-like 1) is a structural homolog of the formyl peptide 
receptor (FPR). FPR is a G protein-coupled receptor, expressed on neutrophils and 
phagocytic cells, that is stimulated by N-formyl peptides of bacterial origin. Specific binding 
of the natural ligand, f-Met-Leu-Phe, stimulates transduction of a signal to mobilize calcium, 



Readout 
fusl-HIS3 



test 

strain 

CY4029 
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resulting in cellular changes including chemotaxis and the release of granule contents. Low 
stringency .hybridization of HL60 cDNA libraries with an FPR cDNA probe permitted the 
identification of the related receptor, FPRL-1 (Murphy et al. supra; Ye et al. supra). The 
FPRL-1 cDNA encodes a 351 amino acid protein with 69% sequence homology to FPR 
(Murphy et al. supra) FPR and FPRL-1 were found to co-localize to human chromosome 19 
and to have a tissue expression pattern identical to that of FPR, i.e., expression is restricted to 
cells of myeloid origin (Murphy et al. supra). Ye et al. {supra) demonstrated weak binding 
of f-Met-Leu-Phe (uM concentrations) to fibroblasts transfected with FPRL-1 cDNA. In 
contrast, Murphy et al. {supra) could not detect binding of N-formyl peptides to Xenopus 
oocytes transfected with FPRL-1 cDNA. FPRL-1 appears to be an orphan receptor whose 
specific ligand differs from the formyl peptide ligands to which FPR responds. 

In this example experiments detailing the following will be described: (1) 
establishment of a strain of yeast designed to express the human orphan G protein-coupled 
- receptor FPRL-1; (2) expression of a random peptide library in the aforementioned strain of 
yeast; and (3) activation of the endogenous yeast pheromone pathway upon stimulation of the 
FPRL-1 receptor by a peptide encoded by a random library expressed within the same strain 
of yeast. 

Preparation of FPRL-1 Yeast Expression Vector 

A plasmid, pFPRLl-L31, containing a 2.6 kb EcoRI-Xhol fragment encoding the FPRL-1 
cDNA in the BluescriptIISK+ vector was obtained from Philip Murphy (NIH). The sequence 
encoding FPRL1 was amplified by the polymerase chain reaction using VENT polymerase 
(New England Biolabs, Inc., Beverly, MA) through 20 cycles and the following 
oligonucleotide primers: 

#1 S'GGCGCCCGGTCICCCATGGAAACCAACTTCTCCACT^C^Ljb/OJ?^ 
#2 5'GGCGCCCGGI^CGATCCCAT^GCCTGTAACTCAGTCTC A (Sc^Ll^/<? , ^) 

The PCR product was purified, restricted with Bsal and cloned into Cadus 1651 (plPBX-1), a 
PGK promoter-driven expression vector, using Ncol and BamHI sites, to yield CADUS 
2311. The sequence of the entire insert was determined and found to be identical to the 
FPRL-1 sequence deposited in GenBank (accession number M84562). 

Preparation of Random Oligonucleotides 

Library-Recycling Protocol to Identify a Surrogate Ligand 

The yeast strain CY1141 (MATalpha farl*1441 tbtl-1 fusl-HIS3 can 1 
stel4::trpl:;LYS2 ste3*1156 gpal(41)-Galphai2 lys2 ura3 leu2 trpl his3) was used in the 
experiments that follow. CY1141 contains a pheromone inducible HIS 3 gene, fusl-HIS3 



, integrated at the FUS1 locus and a hybrid gene encoding the first 41 amino acids of GPA1 
(yeast G alpha) fused to sequence encoding human G alphai2 (lacking codons encoding the 
N-terminal 33 amino acids) replacing GPA1 at its chromosomal locus. The yeast STEM 
gene is disrupted to lower the basal level of signaling through the pheromone response 
pathway. The yeast a-factor receptor gene, STE3, is deleted. CY1 141 was transformed with 
Cadus 231 1 to yield CY6571, a strain. expressing the human orphan receptor, FPRL-1. 

CY6571 exhibited LIRMA (ligand independent receptor mediated activation), that is, 
activation of the yeast pheromone pathway in the absence of ligand. It was determined that 
the yeast growth on selective media that resulted from LIRMA was eliminated by the 
additional of 2.5millimolar concentrations of 3-aminotriazole (AT). AT is an inhibitor of the 
H1S3 gene product that serves to reduce background growth. Therefore, selection protocols 
aimed at the identification of surrogate ligands for the FPRL-1 receptor were carried out at 
this concentration of AT. 

CY6571 was inoculated to 10 mis of standard synthetic media (SD) lacking leucine (- 
Leu) and incubated overnight at 30°C. The 10 ml overnight culture was used to inoculate 50 
mis of YEPD; this culture was incubated at 30°C for 4.5-5 hours at which time the cells were 
harvested and prepared for transformation with DNA encoding a random peptide library 
[alpha-NNK (6.24.94)] encoding tridecapeptides of random sequence, by electroporation. 
Post electroporation (in 0.2 cm cuvettes, 0.25 \xF, 200ft, 1.5 kV) the cells were immediately 
diluted in 1 ml ice-cold 1M sorbitol and 100[iL aliquots were placed onto 10 synthetic media 
plates (pH6.8) lacking leucine and uracil (-Leu-Ura). The plates were incubated at 30°C for 
2-4 days at which time two replicas of each original transformation plate were made to 
. synthetic media (pH6.8) lacking leucine, uracil and histidine and supplemented with 2.5mM 
AT(-Leu-Ura-His+2.5mM AT). The replicas were incubated at 30°C for 3-5 days. Post 
incubation the colonies present on the replica sets of two were scraped from the plates into a 
total of 10 mis of H2O (5 mis each plate). The ODgQO °f eac h ce ^ suspension was 
determined and crude plasmid isolations were done on 8-16 OD units of cells for ,each pool. 
A total of eight pools resulted, due to lower numbers of yeast colonies present in four sets of 
plates. The pellets obtained from these crude plasmid isolations (the so called "smash and 
grab" technique, Methods in_ Yeast Genetics - A Laboratory Manual, 1990, M.D. Rose, F. 
Winston and P. Heiler. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.), 
were resuspended in 40|iL of 10 mM Tris, 1 mM EDTA, pH8.0 and l|iL was used to 
transform E. coli by electroporation (0.1. cm cuvettes, 0.25 \x¥, 200Q, 1.8 kV). Post 
electroporation the cells were immediately diluted into 1 ml 2XYT media and incubated, 
with shaking, at 37°C for 30 minutes after which time the cells were used to inoculate 50 mis 
of 2xYT supplemented with 100 ug/ml ampicillin. The 10 resulting cultures were incubated 
at 37°C overnight. Plasmid DNA was isolated from each of these bacteria cultures using 
Qiagen columns (Qiagen, Inc., Chatsworth, CA)). Each plasmid DNA pellet was 
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resuspended in 50^iL Tris lOmM, EDTA 1 mM, pH 8.0. 

Strain CY6571 was transformed with IjiL of each plasmid pool by electroporation. 
Post electroporation the cells were diluted into 400^L 1M sorbitol. From each electroporated 
cell suspension, l|iL and 400jiL of cells were plated on -Leu-Ura synthetic media, pH6.8 to 
yield M low density" and "high density" platings. The plates were incubated at 30°C for 3. 
days, at which time replicas of both the low and high density plates were made to -Leu-Ura- 
His+2.5mM AT. For those cases where enrichment for a plasmid capable of conferring a 
His+ phenotype had occurred, this would be reflected by an amplified number of His+ 
colonies on both the low and high density plates visible at days 2-3, although the 
amplification would be most obvious on the plates that had received a high density of cells. 
In the FPRL-1 experiment 1/8 pools showed amplification of His+ colonies. The cells were 
scraped from this plate into 5 mis of H2O, the ODgoo °f * e ce ^ suspension was determined 
and a crude plasmid isolation was done on 15 OD units of yeast cells. The pellet obtained 
was resuspended in 40|iL lOmM Tris, 1 mM EDTA, pH8.0 and ljaL was used to transform 
E. colt. Plasmid DNA was isolated by miniprep from 3 ml 2XYT cultures of single bacterial 
colonies resulting from this transformation. 10 DNA pellets (Al through A10) deriving from 
individual bacterial colonies were resuspended in 20^L 10 mM Tris 1 mM EDTA, pH8.0 and 
used to transform CY6571 (containing the FPRL-1 expression vector) and CY6263 (CY1 141 
containing a control expression vector lacking any receptor sequence) by electroporation. 
Cadus 1625, a control vector lacking sequences encoding a peptide, was included and used to 
transform both the receptor* and receptor- strains of yeast. Transformants were first selected 
on -Leu-Ura, pH6.8 then three yeast transformants of each type (from 11 CY6571 
transformations and 1 1 CY6263 transformations) were patched to -Leu-Ura, pH6.8 to expand 
the colonies. Once expanded, streaks of the transformants were made on -Leu-Ura- 
His+2.5mM AT to test for growth in the absence of histidine. All plasmids except the one 
denoted A2 conferred a growth advantage on media lacking histidine to yeast bearing the 
FPRL-1 -encoding plasmid but not to yeast lacking the receptor plasmid. The peptide 
sequence found to be enco^d plasmids Al and , A3-A10 is: 
SerLeuLeuTrpLeuThrCysArgProTrpGuKlaMet^ and is encoded by the nucleotide sequence 
5'-TCT CTG CTT TGG CTG ACT TGT CGG CCT TGG GAG GCG ATG-3'^* ,a 7 

A- 

Activation of the Pheromone Response Pathway in Yeast Expressing the FPRL-1 Receptor 
and Peptide Agonist. 

For verificatiin of pheromone pathway activation and quantification of the 
stimulation, the activity of the fusl promoter was determined colorimetrically using a fusl- 
lacZ fusion in a parallel set of test strains. CY1141, described above, was used as the 
recipient strain for these experiments. Transformants contained CADUS 1584 (pRS424- 
fusl-lacZ) in addition to receptor (R +/ ~) and ligand (L +/ -) plasmids. Four strains (bearing the 
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identical plasmids) were grown overnight in minimal media lacking leucine, uracil, and 
tryptophan, pH8.6. The overnight cultures were used to inoculate -Leu -Ura -Trp pH6.8 
media and these new cultures were grown for approximately 4.5-5 hours to an OD 600 of less 
than 0.4. Assay of P-galactosidase activity (Guarente 1983) in cells from these cultures 
yielded the following results: 

CY1 141/CADUS 231 1/peptide Al/CADUS 1584 R+I+ 28 units 

CY1 141/CADUS 231 1/CADUS 1625/CADUS 1584 . R+L- 3 units 

CY1 141/CADUS 1289/peptide Al/CADUS 1584 R-L+ 3.5 units 

CY1 141/CADUS 1289/CADUS 1625/CADUS 1584 R-L" 3.9 units 

The presence of receptor and peptide-encoding plasmids resulted in an average 8-fold 
stimulation over background levels of P-galactosidase. . 

Autocrine activation of the pheromnne r e sponse nathwav in veast expressing hv FPRT -1 
agonists or C5a receptor agonists. 

The results illustrated in Figure 11 were obtained using yeast cells engineered to 
express FPRL-1 or the C5a receptor under conditions wherein the signal transduction from 
the heterologous receptor was coupled to a fusl :lacZ reporter gene construct described above. 
Figure 11 demonstrates the specificity of the surrogate ligand A5 for FPRL-1, and the 
surrogate ligand F6, as well as that of the native C5a ligand, for the C5a receptor! In each 
instance, the presence of both the receptor and surrogate peptide result in an 8-12 fold 
increase in lacZ expression over the level observed in the absence of either the receptor, 
ligand, or both. 

Activation of Human Ne utrophils hv a surrogate FPRL agonist. 

Human neutrophils in culture were stimulated with varying concentrations of the 
FPRL surrogate ligand A5, and intracellular Ca** mobilization was detected by Fluorescence 
Activated Cell Sorter (FACS) analysis based on FURA2 dye absorbance ratios. The response 
of the human neutrophils to the C5a peptide was also-measured. As shown in Figure 12, the 
A5 peptide produced a dose-dependent increase in intracellular calcium mobilization, 
indicating that it is capable of activating endogenous FPRL-mediated pathways in human" 
neutrophil 

Preparation of second generation FPRL ligand libraries 

To further improve the selectivity and/or potency of the agonists identified by the 
above steps, we selected a surrogate peptide (A5), and created degenerate peptide libraries 
based on the sequence of that peptide as a starting point (e.g., a semi-random library) as 
follows: 



FPRL-1 peptide A5 Ser-Leu-Leu-Trp-Leu-Thr-Cys-Arg-Pro-Trp-Glu-Ala-Met 

eub- libraries /> . . r\ 

N-term Xaa-Xaa-Xaa-Xaa-Xaa-Thr-Cys-Arg-Pro-Trp-Glu-Ala-Met 



mid4 Ser-Leu-Leu-Trp-Leu-Xaa-Xaa-Xaa-Xaa-Trp-Glu-Ala J^et^ 
C-term Ser-Leu-Leu-Trp-Leu-Thr-Cys-Arg-Pro-Xaa-Xaa-Xaa-Xaa^ x 
mod2 Xaa - Leu - Xaa - Trp -Xaa - Thr - Xaa -Arg-Xaa - Trp - Xaa - Ala - Xaa . -A 



i-xaa. s\ 



mod3 Ser-Xaa-Leu-Xaa-Leu-Xaa-Cys-Xaa-Pro-Xaa-Glu-Xaa-Me 

Following the protocols set out above for the first generation peptide library, the second 
generation peptide library was screened, and individual clones isolated based on their ability 
to stimulate FPRL receptor dependent transcription. 



Example 10: Identification of surrogate ligands using expression of a random peptide 
library in yeast expressing the orphan mammalian receptor, MDR-1S. 

In a similar manner a plasmid encoding the monocyte derived receptor monocyte- 
derived receptor 15 (MDR15; Barella et al. (1995) Biochem. J. 309:773-9) was used to 
construct a yeast strain (CY6573) expressing- this receptor. This receptor is an alternative 
spliced form of the Burkitt's lymphoma receptor 1 (BLR1) encoded by a human Burkitt's 
lymphoma cDNA (Dobner et al. (1992) Eur J. Immunol; 22, 2795-2799). Strain CY6573 
was transformed in a similar manner with the NNK13 library, and, following selection on ten 
-Leu-Ura (4.4 x 10 5 colonies per plate), replica plated to -Leu-Ura-His-i- ImM AT plates. 
Upon reisolation of plasmid pools and re-transformation into strain CY6573; eight of ten 
pools showed signicantly enriched colony formation on -Leu-Ura-His+ ImM AT plates. 
Eight unique plasmids derived from these pools when retransformed into CY6573 conferred 
growth on -Leu-Ura-His+ ImM AT plates. One of these plasmids failed to confer growth in a 
yeast strain lacking the MDR1 5 receptor. 

Example 11: Identification of a ligand using expression of a random peptide library in 
yeast expressing the human thrombin receptor 

The receptor for thrombin, a G protein-coupled receptor, is present on numerous cell 
types including platelets, vascular smooth muscle, fibroblasts and on a subset of cells that 
function in immunity. Thrombin, a serine protease, binds to and cleaves the receptor 
molecule at residue 41, generating a new receptor N-terminus. The post-cleavage N-terminal 
residues then act as a "tethered ligand 1 to activate the receptor molecule (Vu et al. 1994). In 
platelets, signaling through the thrombin receptor has been shown to result in numerous 
effects including stimulation of phospholipase C, mobilization of intracellular Ca^"*" and 



-97- 



inhibition of adenylyl cyclase. 

In this example experiments that detail the following will be described (1) 
establishment of a strain of yeast designed to express the human G protein-coupled receptor 
for thrombin; (2) expression of a random peptide library in the afore-mentioned strain of 
yeast and (3) activation of the endogenous yeast pheromone pathway upon stimulation of the 
thrombin receptor by peptides encoded by a random library expressed within the same strain 
of yeast. 

Preparation of a Yeast Expression Vector for a Mammalian Thrombin Receptor 

The human thrombin receptor was amplified by PCR from pcDNA3:Hu-Thr9b-5' 
(Bristol Myers Squibb) using the following oligonucleotides: 

5' GGGCCATGGGGCCGCGGCGGTTG ^Wol) 

5' CCCGGATCCTAAGTTAACAGCTTTTTGTATAT 3^ (S^^**^ 

The amplified product was purified by gel electrophoresis, restricted with Ncol and BamHI 
and ligated to Ncol and BamHI -cut CADUS 1871, a PGK promoter-driven expression vector, 
to yield CADUS 2260. Cloning into CADUS 1871 introduces a novel stop codon preceded 
by the triplet GlySerVal after the authentic carboxy terminal codon of the human thrombin 
receptor (threonine). In addition, an invertase signal sequence is fused to the authentic amino 
terminus of the receptor. 

CY7467 exhibited LIRMA (ligand independent receptor mediated activation), that is, 
activation of the yeast pheromone pathway in the absence of ligand. It was determined that 
the yeast growth on selective media that resulted from LIRMA was eliminated by the 
addition of 2.5 millimolar concentrations of 3-aminotriazole (AT). AT is an inhibitor of the 
HIS3 gene product that serves to reduce background growth. Therefore, selection protocols 
aimed at the identification of novel peptide ligands for the human thrombin receptor were 
carried out at this concentration of AT. 

P reparation of Random Oligonucleotide Library 
As described above. . 

Recycling Protocol to Identify a Surrogate Ligand 

The yeast strain CY1141 (MATalpha farl*1442 tbtl-1 fusl-HIS3 canl 
stel4::trpl::LYS2 ste3*1156 gpal(41)-Galphai2 lys2 ura3 leu2 trpl his3) was transformed 
with CADUS 2260 to yield strain CY7467, expressing the human thrombin receptor. 



CY7467 was inoculated to 10 mis of standard synthetic media (SD) lacking leucine (-Leu) 
and. incubated overnight at 30 C. The 10 ml overnight culture was used to inoculate 50 mis 
of YEPD media; this culture was incubated at 30 C for 4.5-5 hours at which time the cells 
were harvested and prepared for transformation with DNA encoding a random peptide library 
[alpha-NNK (6:24.94)] by electroporation. Post electroporation (in 0.2 cm cuvettes, 0.25 mF, 
200 W, 1.5 kV) the cells were, immediately diluted in 1 ml ice-cold 1M sorbitol and lOOmL 
aliquots were plated onto 10 synthetic media plates (pH6.8) lacking leucine and uracil (-Leu- 
Ura). The plates were incubated at 30 C for 2-4 days at which time two replicas of each 
original transformation plate were made to synthetic media (pH6.8) lacking leucine, uracil 
and histidine and supplemented with 2.5mM AT(-Leu-Ura-His+ 2.5rnM AT). The replicas 
were incubated at 30 C for 3-5 days. Post incubation the colonies present on the replica sets 
of two were scraped from the plates into a total of 10 mis of H20 (5 mis each plate). The 
OD600 °f ea °h ce ^ suspension was determined and crude plasmid isolations were done on 8- 
16 OD units of cells for each pool. A total often pools resulted. The pellets obtained from 
these crude plasmid isolations were resuspended in 40mL of 10 mM Tris, 1 mM EDTA, 
pH8.0 and ImL was used to transform E. coli by electroporation (0.1 cm cuvettes, 0.25 mF, 
200W, 1.8 kV). Post electroporation the cells were immediately diluted into 1 ml 2XYT 
media and incubated, with shaking, at 37 C for 30 minutes after which time the cells Were 
used to inoculate 50 mis of 2xYT supplemented with 100 ug/ml ampicillin. The 10 resulting 
cultures were incubated at 37 C overnight. Plasmid DNA was isolated from each of these 
bacterial cultures using Qiagen columns (Qiagen, Inc., Chatsworth, CA). Each plasmid DNA 
pellet was resuspended in 50mL Tris lOmM, EDTA 1 mM, pH 8.0. 

Strain CY7467 was transformed with ImL of each plasmid pool by electroporation. 
Post electroporation the cells were diluted into 4Q0mL 1M sorbitol. From each 
electroporated cell suspension, ImL and 400mL of cells were plated on -Leu-Ura synthetic 
media, pH6.8 to yield "low density 11 and "high density" platings. The plates were incubated 
at 30 C for 3 days, at which time replicas of both the low and high density plates were made 
to -Leu-Ura-His+ 2.5mM AT. For those cases where enrichment for a plasmid capable of 
conferring a His+ phenotype had occurred, this would be reflected by an amplified number of 
His+ colonies on both the low and high density-plates visible at days 2-3, although the 
amplification would be most obvious on the plates that had received a high density of cells. 
In this experiment 3/10 pools showed amplification of His+ colonies. The cells from each of 
these plates were scraped into 5 mis of H2O, the ODgQO °f ^ e ce ^ suspensions were 
determined and crude plasmid isolations were done on 8-16 OD units of yeast cells. The 
pellets obtained were resuspended in 40mL 10 mM Tris, 1 mM EDTA, pH8.0 and ImL was 
used to transform E. coli. Plasmid DNA was isolated by miniprep from 3 ml 2XYT cultures 
of single bacterial colonies resulting from these transformations (three bacterial colonies for 
each DNA pool were processed in this way). DNAs deriving from three individual bacterial 
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colonies per pool were resuspended in 20mL 10 mM Tris 1 mM EDTA, pH8.0. The three 
DNAs derived per pool were sequenced and found to encode identical peptides. Thus three 
differing DNA sequences were derived, one representing each amplified pool. One plasmid 
representing each of the three original amplified pools was used to transform CY7467 
(containing the thrombin receptor expression vector) and CY6263 (CY1141 containing a 
control expression vector lacking any receptor sequence) by electroporation. CADUS 1625, 
a control vector lacking sequences encoding a peptide was included and used to transform 
both the receptor+ and receptor- strains of yeast. CADUS 1651, a control vector lacking t 
sequences encoding a receptor included and used to transform both the ligand+ and ligand- 
. strains of yeast.. Transformants were first selected on -Leu-Ura, pH6.8, then two yeast 
transformants of each type were patched to -Leu-Ura, pH6.8 to expand the colonies. Once 
expanded, streaks of the transformants were made on -Leu-Ura-His+ 2.5mM AT to test for 
growth in the absence of histidine. One of the three plasmids tested conferred a growth 
advantage on media lacking histidine to yeast bearing the thrombin-encoding plasmid but not 

£ to yeast lacking-the receptor plasmid. The P e P tid ^ £ l ( u 1 e ^ / e ^ oded by ^ plasmid is: Val " 
b Cys-Pro-Ala-Arg-Tyr-Val-Leu-Pro-Gly-Pro-Val^u/and was encoded by the j™ le °ti de . 
£ sequence GTT TGT CCT GCG CGT TAT GTG CTG CCT GGG CCT GTT TTG<^'^^ 
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Table 



1. Detection of C5a production in yeast by ELISA. 
JEzLz r+l- p-t.x 



[C5a] in culture n d HT^ 777 7 ^ 

"•<*• n.d. 0.64 ng/ml 0 .5 ng/ml 

I.CSa] released n d nA "** = 60 

from lysed cells* 9. 8 ng/ml 0.6 ng/ml 

=97 ^ =73 nM 

^l^X^ en2 - e -— — or be „t assay (ELISA) . 

by csa s :;: n :; tions were caicui ^ ed —3 as Predicted 

'Determined by pelleting cell8# resuspe ^ 
orxgxnal volume, breaking yeast „ ith glass beads Ld as 
resulting supernatant. assaying the 

n.d.=not done 
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Table 2. 
G PAl 41 -Gai2 



GPA1 4I -Gai3 



GPAl/?am-Gai2 



G PAl/?am - Gcvl 6 



GPAl/Jam-Gas 



Context- 
single copy, 
integrated, 
GPAl promoter 
single copy, 
integrated, 
GPAl promoter 

low copy plasmid, 
GPA l promoter 

low copy plasmid, 
GPAl vromoter 



low copy plasmid, 
GPAl promoter 



Result - 

Good signal to noise ratio: 
efficient coupling to yeast 

Poor.signai to noise ratio- 
high background due to poor 
coupling to yeast p y , high 
LIRMA*. 

Signal equal to that with 
GPAl 41 - G Q ! i2 < however, back- 
ground is greater. 
Poor signal to noise ratio, 
high background due to poor 
coupling to yeast (J y , high 
LIRMA*. 

Unacceptably high background 
due to poor coupling to 
y^st P' , high LIRMA*. 

media for strains containino w ' T"" 9r °" Ch ° n elective 
^crease lirma. Ic has be ^no ™ antagonists woula 
"hen that receptor is overexor,,- „ ad "nergic receptor 

identification of antagonists ™J£T , inClufli »9 the 

^ or antagonist! „ oul H °" 

conformation i„ such , way to affect the receptor 

=i 3 naUi„ 3 that occurs in the ab°-en 

exploited to iaentif, new G o J ° * 3 ° niBt ' LI "MA can be 
expressing cDNA clones L yeast s - r ™ "'"^ ^ 
O Proteins which couple onlt ^ , «>«— i»9 those chimeric 

P e»it y ™ <>?■ ^ aaaiti b „, 

•J»=i«c for G proteins °" ° f '""""or. that are 



o 



n 
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Table 3, . Coupling of G« switch region hybrids to the 
pheromone response pathway. ' 



Protein 


GPA1 amino 
acid sequences 


Gas amino acid 

w "J, w*i C £> 


Phenotype 


(JaS 


1-472 


none 


couples with 




none 


1-394 


Couples with 
G(3y weakly 


GPA 41 -S 
SGS 


1-41 


42-394 


couples with 
GjSy weakly 




297-33.3 


1-201 + 237-. 
394. 


Does not 
couple with 


UFA^-SGS 


1-41 + 297-333 


zt-ZVl + 237- 
394 


Couples with 
G(3y weakly 



We claim: 

1 . A mixture of recombinant cells, each cell of which comprises: - 

(i) an expressible recombinant gene encoding a heterologous receptor protein 
whose signal transduction activity is modulated by interaction with an 
extracellular signal and . 

(ii) an expressible recombinant gene encoding a heterologous potential receptor 
effector polypeptide, \ 

wherein collectively the mixture of cells expresses a variegated population of said 
receptor effector polypeptides, and modulation of the signal transduction activity of the 
receptor protein by a test polypeptide provides a detectable signal. 

2. A mixture of recombinant cells, ea\h cell of which comprises: 

(i) a heterologous receptor protein whose signal transduction activity is modulated 
by interaction with an extracellWr signals; 

(ii) an expressible recombinant gene encoding a heterologous potential receptor 
effector polypeptide; and ^\^- 

. (iii) a reporter gene construct containing a reporter gene in operative linkage with 
one or more transcriptional regulator elements responsive to the signal 
transduction acitivity of the receptor protein, 
wherein collectively the mixture of cells expresses a variegated population of test 
polypeptides as receptor effectors. \ 

3. The cells of claim 2, wherein the receptor is a nuclear receptor. 

4. The cells of claim 2, wherein the receptor is a cell surface receptor. : - 

5. A mixture of recombinant cells, each cell of which comprises: 

(i) a receptor protein whose signal transduction activity is modulated by 
interaction with an extracellular signals; \ 

(ii) an expressible recombinant gene encoding a heterologous potential receptor 
effector polypeptide; and \ 

(iii) a reporter gene construct containing a reporter gene inoperative linkage with 
one or more transcriptional regulatory elements responsive to the signal 
transduction acitivity of the receptor protein, \ 

wherein collectively the mixture of cells expresses a variegatecmwpulation of test 
polypeptides as receptor effectors. \ 



6. The cells of claim 5, wherein the receptor is a nuclear receptor. 

7. The cells of claim \, wherein the receptor is a cell surface receptor. 

8. A mixture of recombinant cells, each cell of which comprises: 

(i) a cell surface receptor protein whose signal transduction activity is modulated 
by interaction with>an extracellular signal; and 

(ii) an expressible recombinant gene encoding a heterologous potential receptor 
effector polypeptide including a signal sequence for secretion, 

wherein collectively the mixture of cells expresses a variegated population of test 
polypeptides as receptor effectors, anctanodulation of the signal transduction activity of the 
receptor protein by a test polypeptide provides a detectable signal. 

9. The recombinant cells of claim 8, wierein each cell further comprises a reporter 
gene construct containing a reporter gene in Vperative linkage with one ot more 
transcriptional regulatory elements responsiveVo the signal transduction acitivity of the cell 
surface receptor protein, expression of the repor^j>g^ne providing the detectable signal. 

10. The recombinant cells of claim 8, wherei»Vfre~reporter gene encodes a gene product 
that gives rise to a detectable signal selected from trie group consisting of: color, 
fluorescence, luminescence, cell viability relief of a oell nutritional requirement, cell 
growth, and drug resistance. 

11. The recombinant cells of claim 9, wherein the reporter gene encodes a gene product 
selected from the group consisting of chloramphenicol acdfyl transferase, beta-galactosidase 
and secreted alkaline phosphatase. 

12. The recombinant cells of claim 9, wherein the reportengene encodes a gene product 
which confers a growth signal. 

13. The recombinant cells of claim 9, wherein the reporter gene encodes a gene product 
for growth in media containing aminotriazole or canavanine. 

14. The recombinant cells of claim 8, wherein the detectable signa^ comprises 
intracellular calcium mobilization. 



15. The recombinant cells of claim 8, wherein the detectable signal comprises a 1 
significant change in intracellular protein phosphorylation. 



16. The recombinant cells of claim 8, wherein the detectable signal comprises increases 
in phospholipid metabolism. 

17. The recombinantaells of claim 8, wherein each cell further comprises a heterologous 
gene construct encoding tne receptor protein. 

18. The recombinant cell^f claim 8, wherein the receptor protein is a G-protein 
coupled receptor. \ 

19. The recombinant cells of craim 18, wherein the G-protein coupled receptor is 
selected from the group consisting \f: a chemoattractant peptide receptor, a neuropeptide 
receptor, a light receptor, a neurotransmitter receptor, a cyclic AMP receptor, and a 
polypeptide hormone receptor. \ 

2CL The recombinant cells of claim 8 wherein the receptor protein is. a receptor tyrosine 
kinase. )C \ 

21. The recombinant cells of claim 20, wheVein the receptor tyrosine kinase is an EPH 
receptor. \ 

22. The recombinant cells of claim 8, wherein tiae receptor protein is an orphan 
receptor. \ 

23. The recombinant cells of claim 8, which recombinant cells are yeast cells. 

24. The recombinant cells of claim 8, which recombinW cells are mammalian cellsT 

25. The recombinant cells of claim 8, wherein the variegated population of test 
polypeptides includes_at leasU 0j different test polypeptides. \ 

26. A recombinant cell, comprising: \ 

(i) an expressible recombinant gene encoding a heterologous cell surface receptor 
protein whose signal transduction activity is modulated xxy extracellular signals; 

(ii) an expressible recombinant gene encoding a heterologoukpotential receptor 
effector polypeptide including a signal sequence for secretion; and 

(iii) a reporter gene construct containing a reporter gene in operative linkage with 
one or more transcriptional regulatory elements responsive tAthe signal 



transduction acitivity of the cell surface receptor protein. 



27. The recombinant cell of claim 26, wherein the reporter gene encodes a gene product 
that gives rise to a detectable signal selected from the group consisting of: color, 
fluorescence, luminescence cell viability relief of a cell nutritional requirement, cell . 
growth, and drug resistance)^ 

28. The recombinant cell oL claim 26, wherein the receptor protein is a G-protein 
coupled receptor. \ 

29. The recombinant cell of claim 28, wherein the G-protein coupled receptor is. 
selected from the group consisting ok a chemoattractant peptide receptor, a neuropeptide 
receptor, a light receptor, a neurotransmitter receptor, a cyclic AMP receptor, and a 
polypeptide hormone receptor. \ 

30. The recombinant cell of claim 26, wherein the receptor protein is a receptor 
tyrosine kinase. )C\ 

31. The recombinant cell of claim 30, wher\in the receptor tyrosine kinase is an EPH 
receptor. \ 

32. The recombinant cell of claim 26, wherein the receptor protein is an orphan 
receptor. Y 

33. The recombinant cell of claim 26, wherein the rXceptor protein is a cytokine 
receptor. \ 

34. The recombinant cell of claim 26, wherein the receptor protein is an MIRR. 

35. , The recombinant cell of claim 26, which recombinant cell is a yeast cell. 

36. The recombinant cell of claim 35, which yeast cells is a Saicharomyces cell. 

37. The recombinant cell of claim 35, which yeast cells is a Schizosaccharomyces cell. 

38. . The recombinant cell of claim 26, which cells are mammalian qells. 

39. A mixture of recombinant cells, each cell of which comprises: \ 



(i) an ekpressible recombinant gene encoding a heterologous cell surface receptor 
protein whose signal transduction activity is modulated by extracellular signals; 

(ii) an expressible recombinant gene encoding a heterologous potential receptor 
effector\olypeptide including a signal sequence for secretion; and 

(iii) a reporter\ene construct containing a reporter gene in operative linkage with 
one or more\ranscriptional regulatory elements responsive to the signal 
transduction acitivity of the cell surface receptor protein, 

wherein collectively the mixture of cells expresses a variegated population of test 
polypeptides. \ 

40. The recombinant cells of Maim 39, wherein the receptor protein is a G-protein 
coupled receptor. \ 

41. The recombinant cells of claiiA40, wherein the G-protein coupled receptor is 
selected from the group consisting of: kchemoattractant peptide receptor, a neuropeptide 
receptor, a light receptor, a neurotransmitter receptor, a cyclic AMP receptor, and a 
polypeptide hormone receptor. 

42. The recombinant ceil of claim 40, whetein the G-protein coupled receptor is selected 
from the group consisting of: al A-adrenergic receptor, alB-adrenergic receptor, ct2- 
adrenergic receptor, a2B-adrenergic receptor, p l\adrenergic receptor, p2- adrenergic 
receptor, p3-adrenergic receptor, ml acetylcholineVeceptor (AChR), m2 AChR, m3 AChR, 
m4 AChR, m5 AChR, Dl dopamine receptor, D2 diamine receptor, D3 dopamine receptor, 
D4 dopamine receptor, D5 dopamine receptor, Al adenosine receptor, A2b adenosine 
receptor, 5-HTla, 5-HTlb, SHTl-like, 5-HTld, 5HTld\ike, 5HTld beta, substance K 
(neurokinin A), fMLP receptor, fMLP-like receptor, angiotensin II type 1, endothelin ETA, 
endothelin ETB, thrombin, growth hormone-releasing horrtxme (GHRH), vasoactive 
intestinal peptide, oxytocin, somatostatin SSTR1 and SSTRa SSTR3, cannabinoid, follicle 
stimulating hormone (FSH), leutropin (LH/HCG), thyroid stimulating hormone (TSH), 
thromboxane A2, platelet-activating factor (PAF), C5a anaphylWin, Interleukin 8 (IL-8) 
IL-8RA, IL-8RB, Delta Opioid, Kappa Opioid, mrp-l/RANTEs\Rhodopsin, Red opsin, 
Green opsin, Blue opsin, metabotropic glutamate mGluRl-6, histamine H2, ATP, 
neuropeptide Y, amyloid protein precursor, insulin-like growth factor II, bradykinin, 
gonadotropin-releasing hormone, cholecystokinin, melanocyte stimmating hormone receptor, 
antidiuretic hormone receptor, glucagon receptor, and adrenocorticotropic hormone II. 

43. The recombinant cells of claim 39, wherein the receptor proteto is a receptor 
tyrosine kinase. \ 



44. . The recombinantYells of claim 43, wherein the receptor tyrosine kinase is an EPH 
receptor. " 

45. The yeast cell of claim 44, wherein the receptor is selected from the group consisting 
of: eph, elk, eck, sek, mek4 y he\ hek2, eek, erk, tyrol y tyro4 y tyroS, tyro6, tyroll, cek4 9 cek5 > 
cek6, cek7, cek8 y cek9,ceklO y ofy y rtkl, rtk2, rtk3, mykl, myk2 9 ehkl> ehk2,pagliaccio 9 htk, 
erk and nuk receptors. 

46. The recombinant cell of claiip 39, wherein the receptor protein is a cytokine 
receptor. 

47. The recombinant cell of claim 3V> wherein the receptor protein is an MIRR 
receptor. 

48. The recombinant cell of claim 39, ^rff ein the receptor protein is an orphan 
receptor. 

49. The recombinant cell of claim 39, whiMi recombinant cell is a yeast cell. 

50. The recombinant cell of claim 49, whichWeast cells is a Saccharomyces cell. 

51. The recombinant cell of claim 49, which yiast cells is a Schizosaccharomyces cell. 

52. The recombinant cell of claim 39, which cell! are mammalian cells. 

53. The recombinant cells of claim 39, wherein the Variegated population of test* 
polypeptides includes at least 10 3 different test polypeptides. 



54. A method for identifying-potential receptor effector^ comprising: 

(i) providing a mixture of recombinant cells, each Veil of which comprises 

(a) a receptor protein whose signed transduction activity is modulated by 
interaction with an extracellular signal, and* 

(b) an expressible recombinant gene encoding a heterologous test polypeptide, 
wherein the mixture of cells collectively express a variegated population 
of test polypeptides, and modulation of the signal transduction activity of 
the receptor protein by a test polypeptide provides a detection signal; and 

(ii) isolating cells from the mixture which exhibit the detection signal. 



- -Hi - 

55. The method taf claim 54, wherein the cell receptor is a cell surface receptor. 

56. The method oficlaim 55, wherein the heterologous test polypeptide includes a signal 
sequence for secretion.^ 

57. The method of claim 54, wherein each cell of the mixture further comprises a 
reporter gene construct containing a reporter gene in operative linkage with one or more 
transcriptional regulatory elements responsive to the signal transduction acitivity of the cell 
surface receptor protein, expression of the reporter gene providing the detection signal. 

58. The method of claim 5A wherein the reporter gene encodes a gene product that 
gives rise to a detection signal selected from the group consisting of: color, fluorescence, 
luminescence', cell viability relief tof a cell nutritional requirement, cell growth, and drug 



resistance. 



59. The method of claim 58, whferidn the reporter gene encodes a gene product selected 
from the group consisting of chloramppicol acetyl transferase, beta-galactosidase and 
secreted alkaline phosphatase. 

60. The method of claim 58, wherein ttt^ reporter gene encodes a gene product which 
confers a growth signal. 

61. The method of claim 58, wherein the rdporter gene encodes a gene product for 
growth in media containing aminotriazole or cahavanine. 

62. The method of claim 54, wherein the detection signal comprises intracellular 
calcium mobilization. 



63. The method of claim 54, wherein the detectionWnal comprises a statistically 
significant change in intracellular protein phosphorylation. 

64. The method of claim 54, wherein the detection sigkal comprises changes in 
phospholipid metabolism. 



65. The method of claim 54, wherein each cell of the mi; 
heterologous gene construct encoding the receptor protein. 



lure further comprises a 




66. The method\of claim 54, wherein the receptor protein is a G-protein coupled 
receptor. 

67. The method of claim 66, wherein the G-protein coupled receptor is selected from 
the group consisting of: a\hemoattractant peptide receptor, a neuropeptide receptor, a 
light receptor, a neurotransmitter receptor, a cyclic AMP receptor, and a polypeptide 
hormone receptor. 

68. The method of claim 54Vherein the receptor protein is a receptor tyrosine kinase. 

69. The method of claim 68, wherein the receptor tyrosine kinase is an EPH receptor. 

70. The method of claim 54, wher\in the receptor protein is a cytokine receptor. 

71. The method of claim 54, wher A^he receptor protein is an orphan receptor. 

72. The method of claim 54, which recombinant cells are yeast cells. 

73. The method of claim 54, which recombinant cells are mammalian cells. 

74. The method of claim 54, wherein the varVegated population of test polypeptides 
includes at least 10 3 different test polypeptides. 



75. A method for identifying effectors of a cell surface receptor comprising: 

(i) providing a mixture of recombinant cells\ each cell of which comprises 

(a) an expressible recombinant gene encoding a heterologous cell surface 
receptor protein whose signal transduction activity is modulated by 

extracellular signals, 

(b) an expressible recombinant gene_encodiW a heterologous potential 

receptor effector polypeptide including a^ignal sequence for secretion, 
and 

(c) a reporter gene construct containing a reporter gene in operative linkage 
with one or more transcriptional regulatory elements responsive to the 
signal transduction acitivity of the cell surfacdreceptor protein, 

wherein the mixture of cells collectively express a vWiegated population of test 
polypeptides, and modulation of the signal transduction activity of the receptor 
protein by a test polypeptide causes a statistically significant change in the level 
nf the reporter Rene; and 



(ii) isolating cells from the mixture which exhibit the detection signal. 



76. A method for identifying ligands for\ 
(i) providing a mixture of recombii? 
(a) a heterologous gene encj 
signal transduction actH 



. orphan cell surface receptor comprising: 
it cells, each cell of which comprises 
an orphan cell surface receptor whose 
; modulated by extracellular signals; and 



(ii) 



(b) an expressible recombinant gene encoding a heterologous test 
polypeptide including a signal sequence for secretion, 
wherein the mixture of cells collectively express a variegated population of test 
polypeptides, and modulation of the signal transduction activity of the orphan 
receptor protein by a test polypeptide provides a detection signal; and 
isolating cells from the mixture which exhibit the detection signal. 




Abstract 



The present invention makes availabie a rapid, effective assay for screening and 

ine preseiu mvw cnwificallv interact with and 

for identifying surrogate ligands for orphan receptors. 
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